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Abstract 

With the advent of generalized unitarity and parametric integration techniques, the construction of a generic 
Next-to-Leading Order Monte Carlo becomes feasible. Such a generator will entail the treatment of QCD 
color in the amplitudes. We extend the concept of color dressing to one-loop amplitudes, resulting in the 
formulation of an explicit algorithmic solution for the calculation of arbitrary scattering processes at Next- 
to-Leading order. The resulting algorithm is of exponential complexity, that is the numerical evaluation 
time of the virtual corrections grows by a constant multiplicative factor as the number of external partons is 
increased. To study the properties of the method, we calculate the virtual corrections to n-gluon scattering. 



1. Introduction 

Automated Leading Order (LO) generators [H, 0, H, 0, Q play an essential role in experimental analyses 
and phenomenology in general. However, the theoretical uncertainties associated with these generators are 
only understood qualitatively. The augmentation of the LO generators with Next-to-Leading Order (NLO) 
corrections will give a more quantitative understanding of the theoretical uncertainties. This is crucial for 
the realization of precision measurements at the Hadron colliders. By calculating NLO corrections using 
analytic generalized unitarity methods @, 0, Q , the one-loop amplitude is factorized into sums over products 
of on-shell tree- level amplitudes. This makes the integration of numerical generalized unitarity methods into 
the LO generators attractive. One can use the LO generator as the building block for obtaining the NLO 
correction, thereby negating the need for a separate generator of all the one-loop Feynman diagrams. The 
generalized unitarity approach reduces the complexity of the calculation through factorization. It can reduce 
the evaluation time with increasing number of external particles from faster than factorial growth to slower 
than factorial growth. 

By utilizing the parametric integration method of Ref. [9( significant progress has been made in the 
algorithmic implementation of generalized unitarity based one- loop generators 10, U| and other non- unitary 



methods [12] Q These implementations rely on the color decomposition of the amplitude into colorless, gauge 



invariant ordered amplitudes 
recursion relation algorithms 
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H9I ] . At tree-level these ordered amplitudes can be efficiently calculated by 
2Cj. These algorithms are of polynomial complexity and grow asymptotically 
as n 4 as the number of external partons, n, increases [2~lj |. By replacing the 4-gluon vertex by an effective 
3-gluon vertex the polynomial growth factor can be further reduced to n 3 [22|, [H, 0] . 

At the one-loop level the ordered amplitudes generalize into primitive amplitudes [25]. These primi- 
tive amplitudes reflect the more complicated dipole structure of one- loop amplitudes. While the analytic 
structure of the factorized one-loop amplitude in color factors and primitive amplitudes is systematic, the 
subsequent calculation of the color summed virtual corrections becomes unwieldy in the algorithmic imple- 
mentation [2^ ]. The reason for this is the rapid growth in the number of primitive amplitudes. This rapid 



1 These methods have matured to the point where explicit NLO parton generators for specific processes have been con- 
structed m, m, in, d, [i3 • 
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growth is mainly caused by the multiple quark-pairs amplitudes. A further complication arises from the 
possible presence of electro- weak particles in the ordered amplitudes. 

While in LO generators the analytic treatment of color is more manageable, alternatives were developed 
for high parton multiplicity scattering amplitudes 23, 27, 24[. Those alternatives provided a more numerical 
treatment of the color, thereby facilitating the construction of tree-level Monte Carlo programs for the 
automated generation of high multiplicity parton scattering amplitudes at LO. This was accomplished by 
not only choosing the external momenta and helicities, but also choosing the explicit colors of the external 
partons for each scattering event considered. In doing so, the tree-level partonic amplitude is a complex 
number and the absolute value squared is simply calculated. This numerical treatment can be done in 
the context of ordered amplitudes [28| by calculating the explicit color weights of each ordered amplitude. 
This method was generalized to one-loop calculations in R ef. J12I1. More directly, one can reformulate the 
recursion relations into color-dressed recursion relations 2^, 23, 24]. These color-dressed recursion relations 



integrate the now explicit color weights into the recursive formula. The resulting algorithm is of exponential 
complexity and grows asymptotically as 4™ for n-parton amplitudes; again, a reduction of the growth factor 
to 3™ can be achieved if the 4-gluon vertex is replaced by the effective 3-gluon vertex [24[ . 

In this paper we extend the generalized unitarity method of Ref. 10] as implemented in Ref. [30( to 
incorporate the color-dressing method. The algorithm is developed such that it can augment a dressed LO 
generator such as COMIX [B| to become a NLO generator^ For the numerical examples presented in this 
paper, we have used our own implementation of a color-dressed LO gluon recursion relation to calculate the 
virtual corrections for n-gluon scattering processes. 

The motivation for color dressing at the one-loop level is discussed in Sec. 2. We outline in Sec. 3 
the tree-level dressed recursion relations for generic theories expressed in terms of Feynman diagrams. We 
optimize the color-sampling performance and study the phase-space integration convergence for LO n-gluon 
scattering. The dressed formalism is extended to one-loop amplitudes in Sec. 4. The scaling with n, the 
accuracy of the algorithm and the color-sampling convergence of the virtual corrections to n-gluon scattering 
are studied in some detail. We summarize our results in Sec. 5. Finally, two appendices are added giving an 
explicit LO 6-quark example and details on the color-dressed implementation of the gluon recursion relation. 



2. Motivation for the Color-Dressed Generalized Unitarity Method 

So far the numerical implementations of generalized unitarity for the evaluation of one-loop amplitudes 
make use of color ordering: the ordered one-loop amplitudes are constructed from tree-level ordered ampli- 
tudes through the Z?-dimensional unitarity cuts. This has the advantage that the color is factorized off the 
loop calculation and attached subsequently to each ordered one-loop amplitude. For the pure gluon one-loop 
amplitude, this leads to a particularly simple decomposition in terms of the adjoint generators F of SU(./V): 

M (0 ' 1) (l,2...,n) ~ Tr(F ai F a2 ■ ■ ■F a ")m (0 ' 1) (l,2, . . . ,n) . (1) 

P(2,3,...,n) 



The decomposition is valid for both tree-level [18( and one-loop amplitudes [3l| . Once we can calculate the 
colorless ordered amplitude m(l, 2, . . . , n), all other ordered amplitudes are obtained by simple permuta- 
tions. All kinematic information about the n-gluon amplitude is encapsulated in a single ordered amplitude. 
However, we also see the drawback of this approach as we are interested in evaluating the amplitude squared. 
We have to calculate 2 . . . , n) x (M,^(l, 2 . . . , n)) summed over all color and spin states of the 

external gluons. This immediately leads to a factorial complexity when doing the multiplications of the full 
amplitudes as we have to sum over the permutations, P(2, 3, . . . , n), of the ordered amplitudes. Additionally, 
the color sum has to be performed either analytically or in some numerical manner. 

When including quark pairs the situation becomes even more complicated. The reason is that the internal 
structure of the one-loop amplitude is not uniquely defined by the external states, thereby affecting the color 



2 The LO matrix-element generator needs to be upgraded to allow for complex external momenta. 
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flow of the ordered amplitudes. As a result there exist many types of ordered amplitudes depending on the 
internal configuration of quark and gluon propagators. These amplitudes are called primitive amplitudes [25| 
and in general cannot be obtained from each other by simple permutations. For example, the one-loop qq + n 
gluon amplitude is given by [3ll ] 

M^(q;l,...,n;q) ~ 

n 

J2 J2 (T y T a i---T a *T x ) ij (F a ^---F a «) xy mW(q,l,...,k,q,k + l,...,n) . (2) 

k=2 P(l,...,n) 

where the T-matrices are the fundamental generators of SU(iV). While for the full amplitude a cut line has 
an undetermined flavor, each primitive amplitude has an unique flavor for all the cut lines. Therefore we 
can apply generalized unitarity to the primitive amplitudes. However, from a numerical/algorithmic point 
of view the evaluation of this equation becomes tedious as can be seen for instance in the calculation of the 
one-loop matrix elements for W + 5 partons in Ref. [26[ . 

It is clear that for an automated generator of one- loop corrections one would like to avoid ordered/primitive 
amplitudes altogether. For LO matrix elements, this can be done by applying the color-dressed recursion 
relations to evaluate the (unordered) tree-level amplitudes. From these color-dressed tree-level amplitudes 
we can build the one-loop color-dressed amplitudes by applying generalized unitarity, thereby circumventing 
the need for primitive amplitudes and explicit color summations. It is of interest to investigate the feasibility 
of this approach. The n-gluon scattering process is good for studying the behavior of the dressed algorithm. 
The color-ordered approach is most effective for n-gluon scattering. For processes with quark-pairs, the 
color-dressed approach will become even more efficient compared to the color-ordered approach. 

An additional advantage of the color-dressed algorithm is that it treats partons and color neutral particles 
on the same footing. Specifically, we can include electro-weak particles without altering the algorithm. This 
is in contrast to the color-ordered algorithm, where the addition of electro-weak particles would lead to 
significant modifications in the algorithmic implementation of the method. 



3. Dressed Recursive Techniques for Leading Order Amplitudes 



In tree-level generators the Monte Carlo sampling over the external color and helicity states has become 
a standard practice 23|, 27 , 24 1 . Such a color sampling allows for the efficient evaluation of large multiplicity 
partonic processes. A particular efficient implementation of the color-dressed Monte Carlo method uses the 
color-flow decomposition of the multi-parton amplitudes 2^, 3^, 24 1 . 

The principle of Monte Carlo sampling over the states of the external sources generalizes to any theory 
expressible through Feynman rules. By explicitly specifying the quantum numbers of the n external sources, 
one can evaluate the tree-level amplitude squared and differential cross section using Monte Carlo sampling: 



dcr LO U 1/2 



/3 ' ' ' fn) 



E 

T = \ 



dPS^iK.K, - K 3 ■ ■ ■ K n ) \MW (f[ r) ,4 r \ ■ ■ 



(3) 



where 

f« = {fi,h fi ,C fi ,K fi }^ (4) 

denotes the flavor, spin, color and momentum four-vector of external state i for event rjf] The constant 
Ws contains the appropriate identical particle factors and Monte Carlo sampling weights. For each event r, 
the external states are stochastically chosen such that when summed over many events we approximate the 
correct differential cross section with sufficient accuracy. 



3 We will use flavor to indicate the particle type, such as e.g. gluon, up-quark, W-boson, etc. 
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3.1. The Generic Recursive Formalism 

To calculate the tree-level amp litude M {0) in Eq. ©, we follow the method of color-dressed recursion 
relations as detailed in Refs. [Hi HI]. A recursion relation builds multi-particle currents from other currents. 
The m-particle current J s (f w ) has m on-shell particles f T = {fijie^ = {f^, . . . ,fj m } where ir = . . . ,i m } 
and one off-shell particle g = {g, L g , C g , K g } with g, L gi C g and K g denoting the flavor, Lorentz label, color 
and four- momentum, respectively. The momentum of the off-shell particle, K g , is constrained by momentum 
conservation: K g = -K„ = - J2ien K i- 

The dressed recursion relation generates currents using the propagators and interaction vertices of the 
theory. Using standard tensor notation we can write the propagators as 

P S1S2 (Q) = S gig2 6 CgiCg2 P L ^(Q) , 

gi 

p^j,^)] - J2 J M pslS2 ( K ^) J M - ( 5 ) 

glg2 

where e.g. the gluon propagator is given by P^ lfl2 (Q) — —g^^'/Q 2 . Note that the particle sums are 
taken over all quantum numbers of the off-shell particles g^. Furthermore, in all expressions momentum 
conservation is always implicitly understood. The on-shell tree-level n-particle amplitude can hence be 
expressed in terms of an (n — l)-current, 

M m (fi,...,f n ) = P- 1 [J(fi,...,f„_i), J(f„)] . (6) 

We denote the interaction vertices of the theory as V Sl ... Sk (Qi, . . . , Qk). The maximal number of legs for 
the allowed vertices of the theory is denoted by Vmax- The number of legs of the vertex is indicated by the 
number of its arguments and the type of vertex is specified by the quantum numbers of the legs. The labels 
gi , . . . , gfc run over the values of all particles of the theory. Symmetries and renormalizability imply that 
many of the vertices are set to zero. The theory is defined by its particle content and its non-vanishing 
vertices, which are generalized tensors: 

v sl ... Sk (Qi, ...,<?*) = v g L ^S h 91 ... Cgk (Qi, • ■ ■ . Qk) ■ (7) 

The sum of all vertices contracted in with currents constitutes the main building block of the recursion 
relation. We define it as 

D s [j(^),...,J% k )] = Yl V SSl ... Sh (K g = -Kn k ,K^,...,K nk )xJ^({ 7 , 1 )x...xJ^(^ k ), (8) 

gi-gfc 

where the inclusive list Ilfc is build up of unions of the exclusive lists: 

fc 

n fc = |J n t . (9) 

Fig. [1] is a pictorial representation of Eq. (jHJ) when using the example of QCD. For this case, we will work 
out the generic vertex blob in detail in the next subsection. 

The recursion relations terminate with the one-particle currents. A one-particle g-current is defined in 
terms of the source S^ f [ Cfi (Kf t ). Hence, we have 

j s (n) = s"*s°' a « (to) 

For example, the gi-gluon one-particle source with helicity Ai, color c\ and momentum K\ is given by 
"-Tg(gi) — ^ CCl£ ^(^i)- I- e - the gi-gluon source is a matrix in color space multiplied by the helicity vector. 
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^giuon(vri,vr 2 ) = H515~6 




"1 



7To 



Figure 1: A graphical representation of Eq. (JSJ for k = 2 and an off-shell gluon in QCD. Because of flavor conservation only 
one of the two vertices can contribute for any given partition. 



The n-particle currents are now efficiently calculated from a recursively defined current in the following 
manner: 

V m ax-1 Sa(n,k) 



j s (h,...,f n ) = J2 E 



D[ J(f Wl ),..., J(U J] 



(11) 



k=2 P„ 



= (l,..,n) 



where S2(n,k) is the Stirling number of the second kind. The first recursive step is graphically illustrated 
in Fig. @ for the example of J g (u, d, s, s, W~). The sum over P 7!l ... 7rk (l, . . . ,n) generates all different 
partitions decomposing the set {1, . . . , n} into the non-empty subsets %x, . . . ,7Tfe. An example for a list of 
different partitions is 

P™ 3 (1,2,3,4) = {7T«7T«4 4) }^ 4 ' 3) " 6 

= { {{1,2}{3}{4}}, {{1,3}{2}{4}}, {{1,4}{2}{3}}, 

{{2,3}{1}{4}}, {{2,4}{1}{3}}, {{3,4}{1}{2}} } . (12) 

The formalism described here fully specifies an automated algorithm of exponential complexity to cal- 
culate the LO differential cross-sections for any theory defined in terms of Feynman rules. Owing to the 
characteristics of the partitioning, the computer resources needed to calculate the n-particle tree-level am- 
plitudes asymptotically grow in proportion to 52 (n, Vrnax)- The exponential behavior arises from the large-n 
limit of the Stirling numbers, i.e. 52 (n, Knax) — * V£ aK [22]. It may be possible to reduce V max by rewriting 
higher multiplicity vertices as sums of lower multiplicity vertices thereby improving the efficiency of the 
recursive algorithm [23, 24 1. For the case of the Standard model this has been fully worked out in Ref. 0] 
and implemented in the COMIX LO generator. 



s 




Figure 2: The first recursion step for the unordered gluon current with u, d, s, s quarks and a W~ gauge boson in the final 
state. There are 15 contributions corresponding to all possible partitions of the final-state particles into two groups. Because 
of flavor conservation there are only 4 non-vanishing contributions for the "4+1" partitions (first term) and 2 non-vanishing 
contributions for the "3+2" partitions (second term). 
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3.2. Multi-Jet Scattering Amplitudes 

We specify the generic recursion relations to the perturbative QCD Feynman rules. This will give an 
algorithmic description of the scattering amplitudes at LO for multi-jet production at hadron colliders. 

The external sources are gluons and massless quarks. All these particles have color and helicity as 
quantum numbers. Instead of the traditional color representation in terms of fundamental generators, we 
choose the color-flow representation 

100, which is more pertinent to Monte Carlo sampling and 
easily derivable from the traditional color representation by making the following two observation: first, any 

internal propagating gluon has as a color factor 5 ab — Tr( T a T b ) Q This color factor can be rewritten as 



S ab Tv(T a T b ) 1 

M = A -^ B " = A K R2 ' B b = Aij j^B Jl . (13) 

Second, we contract the amplitude with ZJ?*. for each external gluon: 

\M\ 2 = M a 5 ab {M b y = M a T^ b l {M b )^ = MyM^ . (14) 



with 



From these observations it follows that we can calculate the interaction vertices in the color-flow represen- 
tation by simply contracting each gluon with T®*, and summing over a^. The three gluon vertex is thus 
given by 

V S1S2S3 {K U K 2 , K 3 ) = V$£Ew. ( K i> K *> K a) 

_ j,ai jras 2 jia 3 y/*i|*2A»sf If fr \ 
H3l !2J2 v aia 2 a 3 K^ll^-il^d) 

= Kh T tX T iX / aia2Q3 V2% m3 (Ki,K 2 , K 3 ) 

= {^^l-^^l)% m3 {Ki,K 2 ,K 3 ) , (15) 

V^^( Kl ,K 2 ,K 3 ) = ^=((Ki ~ K 2 r*g^ + (K 2 - K 3 )^g^ + (K 3 - K^g^) . (16) 
Similarly, for the four gluon vertex we find 

T/ _ T///1//2//3//4 _ \ 1 f Hi r«2 «3 A* 4 J- A* 1 A* 2 A*3 A* 4> \ T// 1 !/ 1 ^ M2/14 f-\ n\ 

^glg2g 3 g4 - V lljll 2 j2l3] 3 t4H ~ I J8 J8 34 il 34 Jl 3J 38/ 4 1 \ L ' J 

C(234) 

with 

yVlH2, //3M4 _ 2 g^ 1 ^ 2 g^ 3 ^ i — g^l^ g^2H4 _ gillVigH2H3 ^8) 

and the sum is over the cyclic permutation of the indices {2,3,4}. In the color-flow representation the 
quark-antiquark-gluon vertex is given by 

^qgq = Vlt^j = (SijJhJ ~ J^^O ^ > (19) 



with 



The external sources are given by 



^- = ^=7^- (20) 



J g ( gl ) = 

J q (qi) = , 

7q(qi) = i*^^) , (21) 



4 Because of this normalization, the structure constants f abc are a factor of \/2 larger than in the conventional definition. 
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where g = {g, \i, (/</), -Kx}, gi = {fifi, Ai, (hji), Kx}, q = {q, s, I, -Kx}, qi = {qx, Ai, ii, Kx}, q 
{g, s, J, —Kx} and qi = {gi, Xx,jx, Kx}- The internal propagating particles are given by 



P glS2 (Q) 


= 




P qiq2 (Q) 
P qiq2 (Q) 


= %w 

= 


— Tin, 

+ run, ) - - 



(22) 



with g fe = {g kl ^ k , (ikjk),Q}, q/c = {<7fc,s/c,U,Q} and q k = {q k ,s k ,j k ,Q}. 

We can now construct Berends-Giele recursion relations (2p| using color-dressed multi-parton currents 
based on Eq. (jTTJ) . The result is 

j q (f l5 ...,f n ) = ]T p^dIA^A^)] 

P^ 2 (X,...,n) 

J s (fx,...,f n ) = P^DiJ^J^)] 

+ Yl P s [D[J(f n ),J(f„ 2 ),J(f V3 )]] , (23) 

P^ 2 „ 3 (l,...,n) 

where each current violating flavor conservation is defined to give zero. The compact operator language can 
be expanded out to an explicit formula by adding back in the particle attributes. For example, 



qgiq 

E P «i (^n 2 ) V"*"* i-K^ , K W1 , K n ) J g2 (f OT1 ) J g3 (f„ a ) 



glg2g3 
1 



+ T^z™^ (-^u^ , ^ , x J(f 2 (f ffl ) x Jfe>3 (f ff2 ) . (24) 

The n-parton tree- level matrix element is calculated using Eq. ^ . We exemplify in appendix El how to 
work out the 6-quark recursion steps using the above formalism. 



3.3. Numerical Implementation of n-gluon Scattering 

The method of color dressing as discussed in this section relies on the ability to perform a Monte Carlo 
sampling over the degrees of freedom of the external sources. In this subsection we will study in some 
detail the properties of such a sampling approach by means of the color-dressed gluonic recursion relation. 
We are particularly interested in the accuracy of the color-sampling procedure and overall speed of the 
implementation. The addition of quarks and external vector bosons is a straightforward extension and will 
not affect the conclusions reached in this subsection. 

The explicit color-dressed gluon recursion algorithm is given in terms of colored gluonic currents. The 
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gluonic currents are 3x3 matrices in color space and defined as 



Jg (gm) 
Jg (§1 ) • • • J gm) 



E 



P g [£>[J( g7ri ),J( g7r2 )] 

rn) 

^ p g [r>[j( gffl ), J(g ff2 ),J(g W3 )] 



P„ 1 „ 2 ^ 3 (l,...,m) 
The color-dressed n-gluon amplitude is given by 

M( )( gl ,g2,... jg n) = [J(gl,g2,...,gn-l),j(gn)] 



(25) 



(26) 



For this specific example, we have labelled the on-shell gluons by gj, the off-shell gluon is denoted by g 
as before. The operator formulation of the recursive algorithm is particularly suited for an object oriented 
implementation of the recursive algorithm. We have implemented the algorithm presented above in C++. 
More details including the more explicit recursion equation are shown in appendix [Bj 

The first issue to deal with is the correctness of the implemented algorithm. To this end we want to 
compare the color-dressed amplitude to existing evaluations of the gluonic amplitudes based on ordered 
amplitudes. To facilitate the comparison, we write the color-ordered expansion of the amplitude using the 
color-flow representation [32j : 



M<°>(gl,g2,...,gn) 



= e ^tiist,-,^) 

P(2,...,n) 

= E Tr(F^...F a ")mW(^,..., ffl >) 

P(2,...,n) 



(27) 



P(2,...,n) 



E ^••■^r 1 ^- (0) (^,...,5^") 

P(2,...,n) 



g^") are ordered amplitudes with the property m^°\l, 2, 



i) = (-l)"m(°i(n 



,2,1) 



The m(°)(^ r 

-By choosing the explicit momentum, helicity 
and color (ij) m of each gluon we can compare the numerical values of Eqs. (|2"6"j) and (|2T|) . We have done 
the comparison up to 2 — * 12 gluon amplitudes and found complete agreement, thereby validating the 
correctness of the color-dressed algorithm. 

An important consideration in calculating the color-dressed amplitudes is the color-sampling method 
used in the Monte Carlo program. For a 2 — * n — 2 gluon scattering amplitude, each of the gluon color states 
is stochastically chosen. The full color configuration of the event is expressed by {(ij) m }m=i wnere 4 and 
j m each denote a color state out of three possible ones that can be labelled {1,2,3}. In the "Naive" approach 
one samples uniformly over all possible color states of the gluons. The number of color configurations, N^ a f lve , 
and the color-configuration weight, W c ^f lvo , are given by 



ArNaivc 
JV col 



9" 



and 



w c N Q r c = i 



(28) 



(29) 



respectively. About 95% of the naive color configurations have a vanishing color factor. This results in a 
rather inefficient Monte Carlo procedure when sampling over the color states. As was noted in Ref. 24[, 



Scattering 


Naive 


Conserved 


Non-Zero 


2^2 


6,561 


639 


378 


2^3 


59,049 


4,653 


3,180 


2^4 


531,441 


35,169 


27,240 


2^5 


4,782,969 


272,835 


231,672 


2^6 


43,046,721 


2,157,759 


1,949,178 


2^7 


387,420,489 


17,319,837 


16,279,212 


2^8 


3,486,784,401 


140,668,065 


135,526,716 



Table 1: The number of color configurations sampled over when using the different Monte Carlo color schemes. 



a significant number of the zero color-weight configurations can be removed by imposing color conserva- 
tion. This is implemented by vetoing any color configuration for which the condition 3c G {1,2,3} : 
2m=i (^m>c ~ <5j m ,c) 7^ is true. In other words, the non-vetoed color configurations can be obtained by 
uniformly choosing the colors i\, . . . ,i n and subsequently generating the colors ji, ■ ■ ■ ,j n through a permu- 
tation of the list . . . ,i n }- For the number of color configurations to be sampled over, this approach, 
which we name "Conserved" , then yields 

n / | \ 2 

AS™ 1 = E <Wn 2+ n 3 ,n , , (30) 

ni, 712,^3—0 x ' 

where n c = , $i m ,c- As this way of sampling is no longer uniform, each generated color configuration 
gets an associated color weight described by 



^Conserve. = ^ ^ 

ni! n 2 ! n 3 ! 



Yet, there still are non-contributing color configurations left in the sampling set. We have to augment the 
selection criteria further by vetoing any color configuration for which the condition 3 c G {1,2,3} : [ Vm G 
{1, 2, . . . , n} : (i m = c — > i m = j rn ) ] is truej^l In other words, we veto a color configuration if all occurrences 
of a particular color c come paired: i m = j rn = c. By adding this veto to the "Conserved" generation, 
we obtain the "Non-Zero" Monte Carlo procedure that has removed all color configurations with zero color 
weight. The number of leftover configurations sampled over is given by 

ArNon-Zero _ sr I n - 

Jv col — / , °n 1 +n 2 +n 3 ,n I III 

ni,n2,n3— x 

I n\- m\n 2 \ n 3 l [l - J2 C ®( n c - 1)] - J2 C ©( n c - 1) n c \ (n - n c )\ 
I ni!n 2 !ns! 



(32) 



where the step function Q(x) — 1 for x > and zero otherwise. The weight associated with each sampled 
color configuration has to be modified and reads 



/ Non-Zcro = /g„ _ 3) [ nl ~ "l ! "2? n 3 \ [l - J2c Q K ~ 1)] ~ Ec ~ 1) (n - fl c )\ 



For up to 10-gluon scatterings, Table Q] displays the resulting number of sampled color configurations in the 
column indicated "Non-Zero". It is also shown how this number compares to the numbers found for the 
"Conserved" and "Naive" sampling scheme. 



5 When all colors are identical, i.e. ii = ji = £3 = jo. = ■ ■ ■ = i n = j n , every color factor in Eg. (127ft is equal to one. We can 
still veto the event because the sum over all ordered amplitudes is identical to zero at tree level [2C|l . 
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Scattering 


color ordered 


color dressed 


color dressed 










(Vmax = 4) 


(Vmax 


= 3) 


2 -> 


2 


0.0313 




0.117 


0.083 




2 -> 


3 


0.169 


(5.40) 


0.495 (4 24) 


0.327 


(3.93) 


2 -» 


4 


0.791 


(4.68) 


1.556 (3 ' 14) 


0.822 


(2.51) 


2 -> 


5 


3.706 


(4.69) 


c i i (3.93) 

6.11 


2.66 


(3.23) 


2 -> 


6 


17.83 


(4.81) 


25.26 (4 ' 13) 


7.55 


(2.84) 


2 -> 


7 


99.79 


(5.60) 


93.43 (3 70) 


24.9 


(3.30) 


2 -> 


8 


557.9 


(5.59) 


392.4 < 4 ' 20 > 


76.1 


(3.05) 


2 -> 


9 


2,979 


(5.34) 


1,528 (3 ' 89) 


228 


(2.99) 


2 -> 


10 


19,506 


(6.55) 


5,996 (3 ' 92) 


693 


(3.04) 


2 -> 


11 


118,635 


(6.08) 


24,821 (4 ' 14) 






2 -> 


15 






6,248,300 (3 984) 







Table 2: The time (in seconds) to evaluate 10,000 color-dressed tree-level amplitudes for 2 — > n — 2 gluon scatterings. Only 
color configurations with non-zero weight are taken into account. Also indicated is the growth factor (given in brackets) with 
increasing n. To compute the amplitudes a 2.20 GHz Intel Core2 Duo processor was used. 



Next we examine the execution time of n-gluon scattering amplitudes using the "Non-Zero" color sam- 
pling. In Tabled the CPU time needed to calculate the color-dressed amplitudes according to Eq. ([25]) and 
Eq. ((27)) are compared. 

The evaluation of Eq. (|27[) employs the ordered recursion relation [2(| . Naively one would expect this 
evaluation to grow factorially with the number of gluons. However this growth is considerably dampened 
by sampling over non-zero color configurations only. Note that for a given event we calculate each ordered 
amplitude with non-vanishing color factor independently of the other ordered amplitudes. One can speed 
up the computation time by sharing the calculated sub-currents between different orderings. This, however, 
is outside the scope of this paper. 

For the evaluation of Eq. ([26)) we use the color-dressed recursion relation of Eq. ([25]) . To study its time 
behavior we apply this recursion as discussed in appendix [B] with and without the 4-gluon vertex. As can 
be seen from Table [21 the required CPU times scales as 4™ or 3™ if the 4- gluo n vertex is neglected. This 



exponential scaling was derived in Ref. [22], |24( . The derivation, following [2J| , uses the recursive buildup 
of the amplitude. To calculate an n-particle amplitude using a U-point vertex, we have to evaluate the 
(n — l)-particle current of Eq. ([6]). This current in turn is determined by calculating all ("^ ) m-particle 
sub-currents, where n — 1 > m > 2. Each m-current is constructed from smaller currents using Eq. pip 
thereby employing the U-point vertex. All possible partitions into V — 1 sub-currents are given by the 
Stirling number of the second kind, 52 (m, V — 1). This leads to the following scaling of the calculation of 
the n-particle amplitude 

n-l , _ -I \ 

T - = E \ ^(m, V - 1) = 5 a (n, V) ~ V n . (34) 



Consequently, the n-gluon amplitude using the standard 3-gluon and 4-gluon vertex has an exponential 
scaling behavior T n — > 4™. This is evident from the results shown in Tabled As can also be seen in the 
table, the scaling behaves as expected when the 4-gluon vertex is left out, i.e. T n — > 3". As was shown in 



Ref. [23], [24j , the 4-gluon vertex can be avoided and replaced by an effective 3-point vertex. This results in 
a significant time gain for the evaluation of high multiplicity gluon scattering amplitudes. 

An important consideration in the usefulness of the color-sampling approach is the convergence to the 
correct answer as a function of the Monte Carlo sampling size Amc- To this end, we compare the color- 
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sampled result S^' c for the tree-level amplitude squared, 



S$c,r = W col ( ni ,n 2 ,n 3 ) x M'^gf 1 ,...,^) , (35) 



to the color-summed, i.e. color-exact, result 

3 3 



2 



S Z= E E |^ (0) (gl r) >--->gi r) ) • (36) 

ii,...,i„=l ii,...,J n =l 

We plot the ratio of the average value for the color-sampled amplitude squared and its standard deviation 
over the average value of the color-summed amplitude squared as a function of the number of evaluated 
Monte Carlo events: 

/c(°) \ _1_ 



We define the ratio this way so that most of the phase-space integration fluctuations are divided out. The 
average values are computed via 

< 5(0) > = AT" E 08) 

r— 1 

where the index r numbers the different events with the only exception that the gluon polarizations have 
held fixed: Ai, . . . , A„ = H — . . . H — (+). The standard deviation of the average is calculated by 

/Ef"i(^) 2 ^Mc(5M> 2 

a < s(0) > = 5wH • (39) 

The 4- and 6-gluon scattering results are shown in the respective top parts of Figs. [3] and 2] for the three 
different sampling methods "Naive" , "Conserved" and "Non-Zero" . The generated phase-space points were 
subject to the constraints: p± : „ t > O.ly^, |?7 TO | < 2 and AR m i > 0.7, see also Eq. (|55l) . As it can be seen 
from the two plots by avoiding sampling over zero-weight color configurations the convergence is greatly 
enhanced. 

For Nmc — O(10 5 ), we obtain sufficient accuracy in the "Non-Zero" sampling method. To illustrate this 
more clearly, we show in the lower panels of Figs. [3] and 0] as well as in Fig. the number of Monte Carlo 
events needed to achieve a certain relative precision in the color sampling. For these plots, we generate 
Nevent events, which are partitioned into trial and sampling events via N cvcat = iV tr i a i x Nmc- We define as 
a function of Nmc the ratio 

^N MC o(0) , „(o) w M \ 

iWiVMc) = HLIT = if ( 4 °) 

Er=l ^coLr \ S coi ) (^Mc) 



and plot Nmc versus the relative precision ct(Rmg) / I^(Rmc) ■ The mean value 

Atrial 



J * trial 

KRmc) = ^— J2 R MC,ki N Mc) (41) 

and the standard deviation 



Vtrial fe=1 



( „ , J Zk=i l (RmcANmc)) 2 - N tl , al ^(R MC ) .... 
a(i?MG) = V • (42) 

are computed by using a sufficiently large number of trials, i.e. iVtrfai estimates of -Rmc(-^mc) & r e calculated 
to obtain the mean value and the standard deviation for Rmc- For Atrial > 0(100), we get rather smooth 
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gg -> 2g (+-+-) 




Number of points N, 



gg -> 2g (+-+-) 

ft; \, i 1 i 1 




100% x *(Rmc(N U c)) I ' fi(Ruc(N M c)) 



Figure 3: Top panel: comparison of Monte Carlo integrations for the various color-sampling schemes, including the standard 
deviation, to the exact color-summed result as a function of the number of evaluated phase-space points. One obtains 1.0034 ± 
0.0091, 0.9989±0.0027 and 0.9999±0.0022 after 10 7 steps for the "Naive", "Conserved" and "Non-Zero" sampling, respectively. 
Bottom panel: number of events required to reach a given relative accuracy on the numerical evaluation of the color-sampled 
amplitude. For the definition of Amc(-^mc) an d the values of the fit parameters determining the dashed curves, cf. the text. 
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gg -> 4g (+-+-+-) 





Naive 






Conserved 






Non-Zero 






10 10" 
Number of points A' 



gg -> 4g (+-+-+-) 



10 



lo- 



10" 



10" 



10 



10 




10 100 

100% x ct(r uc {Nmc)) / v(Rmc(Nmc) 



1000 



Figure 4: Top panel: comparison of Monte Carlo integrations for the various color-sampling schemes, including the standard 
deviation, to the exact color-summed result as a function of the number of evaluated phase-space points. One obtains 1.16±0.32, 
0.995 ± 0.071 and 0.913 ± 0.037 after 10 5 steps for the "Naive", "Conserved" and "Non-Zero" sampling, respectively. Bottom 
panel: number of events required to reach a given relative accuracy on the numerical evaluation of the color-sampled amplitude. 
For the definition of Rmc(N mc ), cf. the text. The fit curves in terms of a/fi(N MC ) are described by 14.0 AC„ , 2.84 iV~°" ' 1 



and 3.10 N~ 



for the "Naive", "Conserved" and "Non-Zero" sampling, respectively. 



"MC 

The "Conserved" and 



'MC 
'Non-Zero" 



approaches are slower by factors of / = 10.5 and / = 13.3, respectively (see text). 
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gg -> 6g (+-+-+-+-) 

io 3 n "\ — ^ 1 — 1 — 1 — 1 — r ~n 1 




100% X 0-(fl M c(iVMc)) /A*(i?Mc(iVMc)) 



Figure 5: Number of events required to reach a given relative accuracy on the numerical evaluation of the color-sampled 
amplitude. For the definition of -RmcC-^mcOj cf. the text. The fit curves in terms of c//i(A r Mc) are described by 8.04 30 , 
3.25 -%yic 40j an< ^ 3.01 344 for the "Naive", "Conserved" and "Non-Zero" sampling, respectively. The "Conserved" and 
"Non-Zero" approaches are slower by factors of / = 9.6 and / = 10.8, respectively (see text). 



curves. In the 4-gluon case shown in the lower part of Fig.[3]this gives a reasonable description for Nmc < 10 5 . 
The 6- and 8-gluon scatterings are more involved and require more statistics. The trend however can be 
read off the respective plots in Figs. [Hand [5] 

For sufficiently large Nmc, the expected scaling of the relative standard deviation a with the number of 
Monte Carlo events is (t(Rmc) ~ As can be seen from the second plot of Fig. [3] the scaling is as 

expected and we can fit to the functional form A x N^c- In the 4-gluon case, we find 

O-(-Rmc) oq o ^ at-0.529 



Naive : " = 33.8 x N, 



MC) 



MC 



Conserved : = 6 - 45 x JV M °' 487 

M (Rmc ) 

Non _ Zero : - 4 - 35 x ^mc 484 ■ (43) 

From these fits we can quantify the enhancements owing to the sampling strategies. The "Conserved" 
sampling method improves over the "Naive" method by a factor of 33.8/6.45 = 5.2, while the improvement 
of the "Non-Zero" method over the "Conserved" method yields an additional factor of 6.45/4.35 = 1.5 (or a 
factor of 33.8/4.35 = 7.8 over the "Naive" method). The algorithm determines the color configurations with 
vanishing color factor before it fully evaluates the corresponding matrix-element weight. The differences 
between the various sampling methods therefore become smaller when we measure the computer evaluation 
time to reach a certain relative precision. When we express this in numbers for the example of 4-gluon 
scattering, we notice that the "Conserved" and "Non- Zero" sampling schemes are slower by factors of / = 
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2.42 and / = 3.29, respectively. This translates into changing the fit parameter A — > A' — Af B . The 
corresponding ratios then read 33.8/9.92 = 3.4 and 9.92/7.74 = 1.3 when specifying the improvement of the 
"Conserved" versus the "Naive" and the "Non-Zero" versus the "Conserved" method, respectively. We see 
using improved sampling over color configurations is still highly preferred. 

4. Dressed Generalized Unitarity for Virtual Corrections 

By using the parametric integration method of Ref. 0] one can implement the generalized unitarity 
method of Ref. into an efficient algorithmic solution [33|]. For the evaluation of color-ordered amplitudes, 



the algorithm is of polynomial complexity 2lJ . To calculate the dimensional regulated one-loop amplitude 
we extend the parametric expressions to D-dimensions and apply the cuts in several integer dimensions 
to determine all the parametric coefficients [10] The algorithm is equally applicable for the inclusion of 
massive quarks [13] ■ The power of this algorithmic solution was demonstrated in Refs. [35, 36|, 3(j for pure 
gluonic scattering. 

Given the fully specified external sources and the interaction vertices, both real and virtual corrections 
can be evaluated by the recursive formulas. The virtual corrections to the differential cross section are given 
by 

Try- Advent 

d<r {v \hh^ h---U) = tt^x dPS^(K 1 K 2 ^K 3 ---K n ) 

^* event n 
r— 1 

2 5R (m® (f{ r) , • ■ x M(1) ( f i W ' ■ • -.tfO) . ( 44 ) 

where the external sources, including momenta and quantum numbers, are sampled through a Monte Carlo 
procedure. The weight Wg is determined by process dependent symmetry factors and sampling weights. 

In this section we show how to use the color-dressed tree-level amplitudes discussed in the previous 
section to construct the color-dressed one-loop amplitudes. By color sampling over the external partons one 
can calculate the virtual corrections using Eq. (T4"4"|) . The generic algorithm will be outlined and applied to 
pure gluon scattering. 

4-1. Generic Color-Dressed Generalized Unitarity 

The one-loop amplitude M^- 1 ' (fi, . . . , f n ) is obtained by integrating the un-integrated amplitude denoted 
by A^ (fi, . . . , f„ | tj over the loop momentum I: 

M^(ft,...,f n ) = J -^A^(h,...,f n \t) . (45) 

The integrand function can be decomposed into a sum of a finite number of rational functions of the loop 
momentum with loop independent coefficients Q . The coefficients can be calculated in terms of tree-level 
amplitudes. 

The parametric form of the integrand is given by the triple sum of rational functions, 

C max max(l,i(fe-l)!) S 2 {n,k) T>, ( C \ P\ 

,,, x-^ x-^ ' * l^jnr-Snt I * ) 

A^(fr,...,i n \£) = J2 E E ~ d nh ffl-./ffl ' ( 46 ) 

k=i RP^...^ k (i,2,...,n) am ....am* m * K) a ^ l) 

where the sum over the propagator flavors grii , ■ ■ • , gn k is required as these are not uniquely defined for 
unordered amplitudes. 



6 If one uses an analytical implementation of the D-dimensional unitarity method of Ref. [id . one can eliminate the penta- 
cuts pS4| . However, in numerical implementations the removal of the penta-cuts requires performing a numerical contour 
integral in the complex plane [ll|l . 
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Figure 6: Graphical representation of a quadrupole-cut partitioning of the external legs into an ordered set of four unordered 
subsets 7Ti , 7T2 , 7T3 , 7T4 of external particles. The corresponding tree-level diagrams are connected with the propagators of particle 
9ni , 9n 2 > 9n 3 , 9n 4 ■ 



The maximum number of denominators needed to describe the dimensional regulated one-loop matrix 
element is C max - The value of C max is given by the dimensionality of the loop momentum. For the one-loop 
calculations in dimensional regularization the maximum dimension of the loop momentum is equal to five, 
i.e. C max = 5. The denominator terms are defined as 



(47) 



with IT m given through Eq. (J9]). The partition sum is over RP ni ...„ k (l, 2, . . . , n) (D P 7ri ... lrs .(l, 2, . . . , n)) 
elements. The total number of elements is given by max (l, \{k — 1)!) x ^(n, k). This extended partition 
list now also includes non-cyclic and non-reflective permutations over the regular partition lists {{7Ti}i = i}; 
more specifically we have: 



RP* 



RP^i 



RR 



77 1 7T2 773 774 775 



{p \ 

/ P P P 

fa 

P P P P 

± tti Tr2Tr47r57r3 7 * Tri7r4 7r5Tr3 7r2 7 J 7riTr5 7r37T2Tr4 7 J 7ri7T37r27r4Tr5 7 

p p p p \ 

- 1 7Ti 7T2 7T5 7T3 7T4 7 7Ti 77 5 77 3 7T.4 7T2 " 771773 774 772 775 5 77i 774 772 77", 773 | 



P P P 

772713774775 5- 1 711773774775772;-'- 771774775712773 l J 771775772773774 1 



(48) 



The polynomial dependence of the numerator functions Vk on the loop momentum is specified with a 
vector of parametric coefficients C gni ... gn . The explicit polynomial forms that we are using are given in 

Rcf. [10]. The dimensionality of the parameter vector C gni ... gn depends on the number of denominators. 
In the case of 5 denominators there is only one parameter, for the terms with 4 denominators we have five 
parameters, etc. . The parameters are determined by putting sets of denominators to zero and calculating 
the residue in terms of tree-level amplitudes. Setting denominator factors to zero is on a par with cutting the 
corresponding propagators as required by generalized D-dimensional unitarity. Let £n 1 ---n a be the "on-shell" 
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loop momentum fulfilling the "unitarity condition" : 

d gni (£ ni -n c ) =■■■= d gnc (£ ni -n c ) =0; c = 2, . . . , C max . (49) 

To fulfill the unitarity conditions we allow also complex values for the components of the loop momenta. 
The parametric form of the numerator functions for c-cuts becomes 

^c(4 ni ... snc \t Ul ...n a ) = Res 9ni ... 9nc (^l (1) (f 1 ,...,f„Kn 1 .-nj) 

- E E *nn'-An E " ~ ~ ^ • ( 50 ) 

ninj n c n c , ~ 1 

m= c +i pp Sli ... iSm (i,...,n) 9ft c+1 -sfi m %+! 1 n r -n c i a 9fi m V*rii • • -11^ 

where the sum PP^ 7f m (lj • ■ ■ , n) over all ml permutations of the m partitions is supplemented with the 
(5-functions to generate the appropriate subtraction functions. Each individual subtraction expression has 
to be evaluated with the appropriate shift of the loop-momentum, „g ■ This equation provides us with 
an iterative procedure starting with the highest number of cuts. For a given number of cuts, the numerator 
function becomes the residue of one-loop integrand function minus the known contributions of terms with 
higher number of denominator factors. The residue of the one-loop integrand factorizes into a product of 
tree-level amplitudes (see e.g. Fig. EJ): 

Res 3ni ... 9nc (a™ (fi, . . . , f„ | %...nj) = [d gni {£) x • • • x d mr (£) x A™ (h, . . . , f„ \ £) 

= E (n^UU^+i)} > ( 51 ) 

gl— g c {k=l ) 

where the index k is cyclic (i.e. g c +i = gi) and gfc denotes the particles resulting from the cut lines. 

We can determine the parametric vector C gn ... 9n in Eq. (|50p by evaluating the right hand side for a 
set of loop momenta fulfilling the unitarity constraint of Eq. (|49[) . The only physics model input is given 
through the tree-level on-shell amplitudes, A4^°\ which we evaluate using Eqs. ([6]) and ([TTJ) . Two of the 
external lines to the on-shell tree-level amplitudes are generated by the /^-dimensional cut lines. These 
external states have in general complex, 5-dimensional momenta. This extension of the momenta does not 
modify the general structure of the tree-level level recursion relations discussed in the previous section. In 
this way we obtain a fully specified algorithm to determine the parameters and thereby the parametric form 
on the left hand side of Eq. . 

It is instructive to illustrate the structure given by Eq. (|46|) for a simple example. Let us consider the 
cut-constructible, D = 4, part of the box terms in 4-gluon scattering (n = 4, k = 4). In this case we have 
no pentagon terms and the numerator functions of the box terms are parametrized by two coefficients 



4 (^31923334 I "Pi fCffiS39452 I "Pi (^31343233 I 



where 



d gi (£)d gi2 [£)dg 123 (£)d gi234 {£) d gi (£)d gi3 (£)d gi34 (£)d gi342 (I) d gi (£)d gi4 (£)d gi42 (£)d gi423 (I) 

Pi {C qiq2 q 3 q 4 \ £j P4 (C qi q 3 q 4 q 2 \ lj P4 {C qi q 4 q 2 q 3 \ tj (52) 

d qi (£)d qi2 (£)d qi23 (£)d qi234 {£) d qi (£)d qi3 (£)d qi34 (£)dq 1342 (£) d qi (£)d qi4 (£)d qi42 (£)d qi423 (£) 

^4 (C hf2f3f4 I £) = Cf hhh + C« 2/s/4 x £-n; n, = e^^p^p^ . (53) 
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The parameters are calculated by using the residue formula of Eq. ([51]) . After the coefficients of the box 
functions have been obtained, one turns to calculate the coefficients of the triangle contributions. The 
numerator function for the triangle cut of the quark- loop contribution, Eq. (|50p . becomes 



934 



^nin 2 n 3 . 



= Res 



9192 934 



(„4 (1) (gi,g 2 ,g 3 ,g 4 | %n 2 n 34 ) 



929394 I ^nin 2 n 34 



Vi ( 



a 



91929493 I *-nin 2 n3 4 



^9l23(^nin 2 Il34) ^9124 (^nin 2 n 34 ) 

where the residuum of the quark loop can be calculated again using Eq. (|5ip . 
R-es giI j 2(?34 ^ (1) (gi,g 2 ,g 3 ,g 4 I ^ixnsiiaj) = 

= \d qi (£) x d qi2 (£) x d qi23i (£) x -A (1) (gi,g2,g 3 ,g4 I £) 

l J £-«n 1 n 2 n 34 

= •^ (0) (q t i I gi ) q2)xM (0) (qU2,q3)xx(°)(qt,g 3 ,g4,qi) 



(54) 



(55) 



qiq 2 q3 



Finally, we can obtain the one- loop amplitude, Eq. (|45[) . by integrating out the parametric forms on the 
right hand side of Eq. (|46|) over the loop momentum. In this way one finds the master-integral decomposition 
of the one-loop matrix element for every specified scattering configuration point ficl | : 



M 



(1) (fi,...,fn) = / ^.4 (i) (fi,...,f„m 



C ma> 



E E 



E s 



(fln 1 --Sn fc ) 



fc=l RP^ 



(1,2,. ...n) mt-'-guu 



{pgn 1 ■ 



■m^gn-L-m,, 



Cg ni ■■■9n fc ^SHi ■ ■9n s 



(56) 



where S^" 1 9rlfc ^ is the loop-integral symmetry factor (e.g. for a gluonic self-energy, the symmetry factor is 
^), the I gni .,, ga denote the scalar master- integral functions corresponding to the generalized cut given by 
the ordered partition list {IT • • -11^} and flavors of the cut lines (<?ni ■ ■ ■ 9n k )- The terms TZ gni ... gn are the 
leading terms of the higher dimensional scalar integrals in the limit D — > 4, 



h 1 /n 2 /n 3 /n 4 



^/n! /n 2 /n 3 



K 



fn 1 fn 2 



(k Ui -k u . 2 y 

6 



m 2 f + m 2 f 

tu 1 Jn 2 



n 



h 1 



= 



The scalar master integrals 



I fu 1 ---fn k = h ( K Ul , ■ ■ ■ ,Ku k ,m fni , 



> m /n. 



(57) 



(58) 



can be evaluated by e.g. using the numerical package developed in Ref. [4l[. In Eq. (|56")l the coefficients C 
and C are determined by applying Eqs. ([50)) and (fSTj) using a numerical algorithm. The C coefficients are 
generated due to the dimensional rcgularization procedure and are associated with the higher dimensional 
terms in the parametric forms. 
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4-2. Numerical Results for the Virtual Corrections of n-gluon Scattering 

We have applied the formalism of the previous sections to multi-gluon scattering. To this end we 
have extended the implementation presented in Ref. (30| . Three major changes are required to alter the 
generalized-unitarity based algorithm for the evaluation of color-ordered amplitudes to a numerical algo- 
rithm capable of calculating color-dressed one- loop amplitudes. First, in the decomposition of the one- loop 
integrands (cf. Eq. (3) of Ref. [3(| and Eq. |@B}), all sums over ordered cuts have to be changed into sums 
over partitions, which include all configurations obtained by non-cyclic and non-reflective permutations: 

E - E (59) 

[h\i h ] RP n ...„ t (l...n) 

Note that [h\ik] — 1 < h < h < ■ ■ ■ < h < n. Second, the tree-level amplitudes occurring in the 
determination of the integrand's residues have to be calculated from eolor-drcssed recursion relations. In 
addition, one not only has to sum over the internal polarizations of the gluons but also over their internal 
colors when computing these residues. Third, gluon bubble coefficients need to be supplemented by a 
symmetry factor of 1/2!. The appearance of the symmetry factor is associated with the parametrization 
ambiguity of the subtraction terms in the double cuts. For example, Eq. (|50[) gives for one of the double 
cuts in 4-gluon scattering 

?M4m334 I t) = R-CS Sl2 g 34 (U (1) (gi,g2,g3,g4 I i)) 

132 934 91334 I ^) ^ , 3(Cg3949i2 I ^) ^3 (C3433312 | £) 

d gi (i) d g2 (£) d g3 (i) d gi (£) 

92 93 94 9193 94 1 1) v 4 (c g 

192 94 93 I ^) ^4(C 929l g 4 g 3 I I) 

d gi (£)d gi23 (£) d g2 (£)d g213 (£) d gi (£)d gi2i (£) d g2 (£)d 321i (£) 

= R.es Sl2g34 ^ (1) (gi,g2,g3,g4 I 0) 

^3(^9192934 I I) _ ^3(^9192934 I -£ + K l+ K 2 ) 
d gi (£) d gi {-£ + K 1 +K 2 ) 

^3(^9394912 I Q _ ^3(^9394912 I -e + Ks + Kj 
d g3 (£) d 93 (-£ + K 3 + K 4 ) 

T ? i(C gig2g3gi I f) 7MC gig293g4 \-£ + Ki + K 2 ) 



d gi (£)d gi2s {£) d gi {-£ + K 1+ K 2 )d gi2a (-£ + K x + K 2 ) 
^4(^1923493 I ^(Cgigigigs I ~^ + K 3 + K A ) 



dgi W d si24 W d gi (-£ + K 3 + Ki)d gi2i {-I + K 3 + K A ) 



(60) 



We see that each of the four possible parametrized terms is subtracted twice with a different choice of the 
loop momentum. The symmetry factor of 1/2! "averages" over the double subtractions. 

The results of the new formalism can be tested thoroughly beyond applying the usual consistency checks 
such as solving for the master-integral coefficients with two independent sets of loop momenta. The value 
of the double pole (dp) can be cross-checked against the analytic result 

MJ,th = - ^nNcM^. (61) 

Moreover, for a given phase-space point, we can use the ordered algorithm of Ref. [3(| to compute the 
full one-loop amplitude of a certain color and helicity (polarization) configuration. Following the color- 
decomposition approach, we can analytically calculate the necessary color factors and sum up all relevant 
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Ordered cuts. 


# 


5-gon 


box 


triangle 


bubble 


sum 


sum„ 


total = sum x 


#orderings 


N(ab) k - 




n 


cuts 


cuts 


cuts 


cuts 




sum„_i 


#orderings 


= (n-l)!/2 


(n-2)! 




4 





1 


4 


6 


11 




33 


3 


2 


3 


5 


1 


5 


10 


10 


26 


2.36 


312 


12 


6 


7 


6 


6 


15 


20 


15 


56 


2.15 


3,360 


60 


24 


22 


7 


21 


35 


35 


21 


112 


2.00 


40,320 


360 


120 


40 


8 


56 


70 


56 


28 


210 


1.88 


529,200 


2,520 


720 


144 


9 


126 


126 


84 


36 


372 


1.77 


7,499,520 


20,160 


5,040 


756 


10 


252 


210 


120 


45 


627 


1.69 


113,762,880 


181,440 


40,320 


2,688 


11 


462 


330 


165 


55 


1012 


1.61 


1,836,172,800 


1,814,400 


362,880 




12 


792 


495 


220 


66 


1573 


1.55 


31,394,563,200 


19,958,400 


3,628,800 





Table 3: The number of cuts required for the calculation of one ordered n-gluon amplitude. The column labelled "total" gives 
the number of cuts when calculating all (n — l)!/2 ordered amplitudes needed to reconstruct the full virtual correction. The 
last two columns list the number of non-zero color- weight orderings for two special color configurations given in the text. 



orderings to obtain the full result. In particular, we have employed: 
M«( gl ,...,g n ) = E A{1) tl(9t,---,9t) 

P{2— n) 



- E 

P(l---n-l) 



int(n/2) n-k+1 n 

NcA v .. n + E E " E (-^^..^Al.^..^ 

fe=l mi = l m fc =m fc _i + l 



where 



m (1) (12---n) (62) 
(63) 



For example, 

•M (1) (gl,g2,g3,g4,g5) 



= E ^ (1) (gl>g2,g3,g4,g5) 



P(2345) 



E ^12345 — Ai A2345 — A2A1345 — A3A1245 — A4A1235 — A 5 A1234 



P(2345) 



A12A345 + A13A245 + A14A235 + A15A234 + A23A145 + A24A135 



A 25 A 13 4 + A 3 4A 12 5 + A3 5 A 12 4 + A 45 A 12 3 ) (12345) 



(64) 



Compared to the LO color-ordered decomposition, Eq. (|2"T)) . the NLO color-ordered decomposition leads 
to many subleading color factors. The number of one-loop ordered amplitudes with zero color weight is 
significantly smaller than the corresponding number for tree-level ordered amplitudes. As a result, the 
advantages of color dressing become more apparent at the one-loop level. 

For a more quantitative understanding of the one-loop amplitude decomposition, we respectively itemize 
in Tables [3] and 0] how many cuts need be applied to decompose the color-ordered and color-dressed one- loop 
integrands for n external gluons. In both cases we separately list the numbers of pentagon, box, triangle 
and bubble cuts and their sum. While for the ordered cuts these numbers are ruled by combinatorics: 
C(n,m) — (™) with m — 1, 5; in the unordered case they are given by the Stirling number^ of the second 



7 More exactly, the number of bubble cuts is given by 2"~ 1 — 1 — n = S2(n, 2) — n, since cuts that isolate one gluon do not 
contribute. For triangle, box, and pentagon cuts, we respectively have (3 n — 3 ■ 2 n + 3)/6 = So.(n, 3), 3 S2{n, 4) and 12 ^(n, 5) 
where, for the determination of the latter two, the recurrence relation S2{n, m) = S2(n — l,m — 1) + mS2 in — 1, m) is of help. 
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Unordered cuts. 


# 


pentagon 


box 


triangle 


bubble 


sum 


sum„ 


ordr total/unordr total 


n 


cuts 


cuts 


cuts 


cuts 


= total 


sum„_i 


orderings 


(ab) k 


(cd)h 


4 





3 


6 


3 


12 




2.750 


1.833 


2.750 


5 


12 


30 


25 


10 


77 


6.42 


4.052 


2.026 


2.364 


6 


180 


195 


90 


25 


490 


6.36 


6.857 


2.743 


2.514 


7 


1,680 


1,050 


301 


56 


3,087 


6.30 


13.06 


4.354 


1.451 


8 


12,600 


5,103 


966 


119 


18,788 


6.09 


28.17 


8.048 


1.610 


9 


83,412 


23,310 


3,025 


246 


109,993 


5.85 


68.18 


17.05 


2.557 


10 


510,300 


102,315 


9,330 


501 


622,446 


5.66 


182.8 


40.61 


2.708 


11 


2,960,760 


437,250 


28,501 


1,012 


3,427,523 


5.51 


535.7 


107.1 




12 


16,552,800 


1,834,503 


86,526 


2,035 


18,475,864 


5.39 


1699 


308.9 





Table 4: The number of cuts needed to calculate color-dressed n-gluon amplitudes. The last three columns give ratios of total 
numbers of cuts required to compute the virtual corrections in both the color-decomposition and color-dressed approaches. 
The first of these columns shows the ratios for all generic color orderings whereas the other columns show the ratios for two 
specific configurations as given in the text. 



kind, 52 (n, m), and therefore grow more quickly with n than those of the ordered cuts. This is exemplified in 
the "sum„/sum n _i" columns of the two tables. The growth factors slowly decrease for larger n, approaching 
the limit of 5 for the color-dressed case. As emphasized in Table |4] the pentagon-cut calculations dominate 
in this case over all other cut evaluations. The large-n growth of the total cut number is hence described by 
that of S2(n, 5) leading to the observed large-n scaling of 5 n . Using the color-decomposition approach, we 
have to deal with much fewer cuts per ordering. However, the total number of ordered cuts is obtained only 
after multiplying with the relevant number of orderings. When considering all possible (n— l)!/2 orderings, 
the final numbers are given in column "total" of Table [3l The last three columns show the number of generic 
orderings and the numbers AT of non-vanishing orderings (i.e. those having non-zero color factors) for two 
color configurations (ab) k = (13)(31)(11) . . . (11) and (cd) k = (22) (12) (23) (31) (11) (22) (33) (11) (22) ... H Of 
course, for a fair comparison between the ordered and dressed approach, the latter two columns are of 
higher interest, since zero color weights are not counted. Still, the ratios of total numbers of ordered versus 
unordered cuts is always larger than one as can be read off the last three columns of Table 2] Keeping in 
mind the greater cost of evaluating dressed recursion relations, the color-decomposition approach can be 
expected to outperform the dressed method as long as these ratios remain of order 0(1). This in particular 
is true for simple color configurations such as (cd)k- 

The analytic knowledge of A4^'(gi, . . . , g n ) presented in Eq. (f6"2")l enables us to perform stringent tests 
of our algorithm and its implementation. We consider 2 — > n — 2 processes where the gluons have possible 
polarization states G {+, — } and colors (ij)k where i k ,jk G {1, 2, 3} and k = 1, . . . , n, i.e. we make use 
of the color-flow notation. Our n-gluon results are given in the 4-dimensional helicity (FDH) scheme [40| ] . 
In almost all cases, we compare our new method labelled by "drss" with the color-decomposition approach, 
which - since it makes use of the ordered algorithm - we denote "ordr" . We will present all our results for 
two choices of loop- momentum and spin-polarization dimensionalities D and D s : the "4D-case" is obtained 
by setting D = D s = 4 and sufficient when merely calculating the cut-constructible part (ccp) of the one- 
loop amplitude. The "5D-case" specified by D = D s = 5 allows us to determine the complete result (all) 
including the rational part. In NLO calculations one identifies the momenta of the external gluons with 
those of well separated jets. We therefore apply cuts on the generated k — 1, . . . , n phase-space momenta 
(I = 3, ...,n): 

\m\ < 2, p x ,i > 0.1\E 1 +E 2 \, AR k i > 0.7, (65) 

where r\i and p±j respectively denote the pseudo-rapidity and transverse momentum of the l-th outgoing 
gluon; ARki describe the pairwise geometric separations in pseudo-rapidity and azimuthal-angle space of 



8 The first four colors are always fixed, supplemented by the repeating sequence (11) (22) (33) according to the number of 
gluons, i.e. for n = 5 we have (cd) k = (22) (12) (23) (31) (11), while forn = 9 we use (cd) k = (22)(12)(23)(31)(11)(22)(33)(11)(22). 
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4D-case 


5D-case 


n 


ordr 


drss 


ordr 


ordr 


drss 


ordr 




-W 

' n 


T (b) 
' n 


r n 


' n 


J>) 

' n 




drss 


' n 


r {b> 


r„ 


' n 


T tb) 


r„ 


drss 


4 


0.027 


0.026 




0.061 


0.062 




0.43 


0.053 


0.052 




0.139 


0.140 




0.38 


5 


0.159 


0.161 


6.04 


0.368 


0.364 


5.95 


0.44 


0.415 


0.412 


7.88 


1.026 


1.029 


7.37 


0.40 


6 


1.234 


1.235 


7.72 


2.152 


2.146 


5.87 


0.57 


3.887 


3.928 


9.45 


7.137 


7.124 


6.94 


0.55 


7 


12.07 


12.00 


9.75 


13.06 


13.08 


6.08 


0.92 


41.66 


41.61 


10.7 


49.62 


49.85 


6.98 


0.84 


8 


131.2 


131.3 


10.9 


80.22 


80.53 


6.15 


1.6 


493.2 


498.6 


11.9 


348.0 


346.9 


6.99 


1.4 


9 


1579 


1563 


12.0 


511.6 


507.8 


6.34 


3.1 


6316 


6296 


12.7 


2466 


2470 


7.10 


2.6 


10 


20900 


20480 


13.2 


3640 


3629 


7.13 


5.7 


88320 


88810 


14.0 


21590 


21620 


8.75 


4.1 



Table 5: Computer times r n in seconds obtained from the 4- and 5-dimensional evaluation of n-gluon virtual corrections at 
two random phase-space points a and b using a 3.00 GHz Intel Core2 Duo processor. The results are shown for both the color- 
ordered and color-dressed method. All virtual corrections were evaluated twice to check for the consistency of the solutions. 
The n gluons have colors (ab)^ and polarizations as specified in the text. Also given are the ratios r n = r n /T n -i where 
T n is the time to compute the correction for n gluons, in particular r n = (r„ + r^ b ')/2. The r n ratios of the ordered versus 
dressed method are depicted in the respective last column of the 4- and 5-dimensional case. 





4D-case 






5D- 


case 






4D/5D 


n 


ordr 


drss 


ordr 


drss 


ordr 


drss 




1 n 


T w 

> n 


r n 


' n 


r (b) 

' n 


r„ 


I n 


T w 

< n 


r„ 




r (b) 

' n 


r n 






4 


0.030 


0.030 




0.069 


0.070 




0.059 


0.059 




0.156 


0.157 




0.51 


0.44 


5 


0.180 


0.179 


5.98 


0.418 


0.413 


5.98 


0.464 


0.465 


7.87 


1.150 


1.148 


7.34 


0.39 


0.36 


6 


1.384 


1.383 


7.71 


2.419 


2.410 


5.81 


4.370 


4.340 


9.38 


8.036 


7.996 


6.98 


0.32 


0.30 


7 


13.53 


13.52 


9.78 


14.64 


14.65 


6.07 


46.65 


46.40 


10.7 


56.06 


55.99 


6.99 


0.29 


0.26 


8 


147.2 


147.5 


10.9 


90.48 


91.60 


6.22 


550.9 


549.5 


11.8 


395.2 


391.9 


7.02 


0.27 


0.23 


9 


1766 


1764 


12.0 


585.9 


585.0 


6.43 


7013 


7029 


12.8 


2844 


2845 


7.23 


0.25 


0.21 


10 


23100 


22830 


13.0 


4233 


4208 


7.21 


98760 


98360 


14.0 


24220 


24410 


8.55 


0.23 


0.17 



Table 6: Computer times r n in seconds for the same settings as used in Table [5] this time using a 2.66 GHz Intel Core2 Quad 
processor. The rightmost part of the table depicts the ratios of 4- versus 5-dimensional computer times for both approaches. 



gluons k and I. We perform a series of studies in the context of double-precision computations: we investigate 
the accuracies with which the double pole, single pole (sp) and finite part (fp) of the full one-loop amplitudes 
are determined by our algorithm. We also examine the efficiency of calculating virtual corrections by means 
of simple phase-space integrations. To begin with, we will verify the expected exponential scaling of the 
computation time for different numbers of external gluons. 

The scaling of the computer time can roughly be estimated by (/ x C max )™. The constants C max = 5 (4) 
and 1 < / < 4 express the fact that the number of pentagon (box) cuts and the exponential scaling with 
ri of the tree-level color-dressed recursion relation respectively govern the asymptotic scaling behavior of 
the unordered algorithm. Although one naively expects / — 4, this factor is reduced by the efficient re- 
use of gluon currents between different cuts. The C^ ax growth of the number of cuts reflects the large-n 
limit of the Stirling number S2{n 1 C max ). We show four tables summarizing our results for the computation 

times T n of obtaining A4^(gi, . . . , g„) = A4„\^k, (ij)k) by using two independent solutions of the unitarity 
constraints. The time for the re-computation has been included in t„. In real applications such a consistency 
check will become unnecessary, thereby halving the evaluation time per phase-space point. Table [5] lists the 
times obtained by running the 4- and 5-dimensional algorithms for the calculation of two random phase- 
space points labelled "a" and "b". The n gluons have colors (ij)k = (ab)k and alternating polarizations 
Afe = Kfe = H — . . . H — (+). Owing to the absence of pentagon cuts we find that the "4D-case" calculations 
are faster. More importantly, the computation time does not vary when the n-gluon kinematics changes. 
Hence, we can calculate the ratios r n — t„/t„_i by defining r„ = (t^ +t^)/2 and show these ratios in the 
table. While for the dressed algorithm these ratios are almost stable, they are larger and increase gradually 
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4D-case 


5D-case 


n 


ordr 


drss 


ordr 


ordr 


drss 


ordr 




1 71 


' n 


r n 






r n 


drss 


< n 


' n 


r n 


-("•*) 

' n 


-(««.) 

' n 


r n 


drss 


4 


0.049 


0.045 




0.074 


0.076 




0.63 


0.088 


0.085 




0.153 


0.155 




0.56 


5 


0.186 


0.185 


3.95 


0.364 


0.364 


4.85 


0.51 


0.479 


0.483 


5.56 


1.000 


1.000 


6.49 


0.48 


6 


1.186 


1.182 


6.38 


2.071 


2.068 


5.69 


0.57 


3.629 


3.586 


7.50 


6.805 


6.752 


6.78 


0.53 


7 


4.185 


4.277 


3.57 


11.82 


11.77 


5.70 


0.36 


14.02 


13.95 


3.88 


44.42 


44.46 


6.56 


0.31 


8 


27.12 


26.96 


6.39 


70.34 


71.10 


6.00 


0.38 


98.52 


99.13 


7.07 


294.8 


297.8 


6.67 


0.33 


9 


245.0 


242.9 


9.02 


443.8 


445.5 


6.29 


0.55 


960.3 


954.8 


9.69 


2080 


2070 


7.00 


0.46 


10 


1442 


1446 


5.92 


3265 


3270 


7.35 


0.44 


5943 


5968 


6.22 


18610 


18480 


8.94 


0.32 


11 








28670 


28690 


8.78 


















6 








2.044 




5.62 


















7 

8 








11.66 
68.85 




5.70 
5.90 


















9 








420.4 




6.11 


















10 








2972 




7.07 


















11 








26310 




8.85 


















12 










292000 


11.1 



















Table 7: Computer times t„ in seconds obtained for the color-ordered and color-dressed evaluation of n-gluon virtual corrections 
in 4 and 5 dimensions using a 3.00 GHz Intel Core2 Duo processor. Results are shown for two different polarization choices 
and Ky.. The virtual corrections were computed at the same random phase-space point with the n-gluon colors set to (cd)^. 
The choices are specified in the text. Ratios r n = T n /T n —1 arc given where r n = (r^ CTfe ' + r^ Kfc ')/2 is the time to evaluate the 
correction for n gluons two times. The re-computation is used to check both solutions for their consistency. The t„ ratios of 
the ordered versus dressed method are depicted in the respective last column of the 4- and 5-dimensional case. 



for the method based on ordered amplitudes. This reflects the (n — 2)1 factorial growth of the number of 
non- vanishing orderings of the color configuration (ab)k as given in Table [3J For the dressed approach, we 
find constant ratios of r n w 6 and r n w 7 in the "4D-case" and "5D-case" , respectively. This manifestly 
confirms our expectation of exponential scaling. The difference between the 4- and 5-dimcnsional ratios 
obviously arises because of the absence of pentagon cuts in the "4D-case" . The r 10 ratios do not fit the 
constant trend. We cannot exclude though that this is a consequence of the occurrence of large structures of 
maps to store the vast number of color-dressed coefficients. The increasing number of higher-cut subtractions 
terms may also cause deviations from the expected scaling, which we derived from our simple arguments 
stated above. Also, the conceptually easier way of storing all coefficients and calculating the largest-m cuts 
first is by far not the most economic in terms of memory consumption]^ For small n, the lower complexity 
of the ordered recurrence relation facilitates a faster calculation of the virtual corrections through ordered 
amplitudes. The turnaround appears for 7 < n < 8 and is just slightly above n — 7 for the "4D-case". 
With n > 8 the dressed method becomes superior owing to the different growth characteristics of the two 
approaches. This is neatly expressed by the "ordr/drss" ratios given in Table 

We have cross-checked the measured computation times in a different processor environment using ex- 
actly the same settings. The results are shown in Table [6] and consistent with those of Table [5l Instead 
of the "ordr/drss" ratios, here we list ratios comparing the 4- and 5-dimensional computation for both ap- 
proaches. They stress the relative importance of the pentagon-cut evaluations, which start to dominate the 
full calculation when n gets large. 

In Table[7]we detail computation times when varying the polarizations of the n gluons while keeping their 
color configuration fixed. We have chosen the two settings Xk = <Jk = + H — ... — and, as before, Afc — Kk- 
In terms of colors we consider the computationally less involved point (ij)k = {cd)k- Both amplitudes are 
calculated at the same random phase-space point "c" dissimilar from previous points "a" and "b" . For none 
of our four calculational options, we notice manifest deviations between the times r^ Afc ^ associated with the 



9 It is for this reason that our calculations are currently limited to n = 12 in the "4D-case" and n = 10 in the "5D-case". 
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4D-case 


5D-case 


4D/5D 


n 


ordr 


drss 


ordr 


ordr 


drss 


ordr 


ordr 


drss 




T n 


r n 




r„ 


drss 




r n 




Tn 


drss 






4 


0.026 




0.062 




0.42 


0.065 




0.151 




0.43 


0.40 


0.41 


5 


0.222 


8.54 


0.394 


6.35 


0.56 


0.615 


9.46 


1.139 


7.54 


0.54 


0.36 


0.35 


6 


1.863 


8.39 


2.378 


6.04 


0.78 


5.544 


9.01 


7.970 


7.00 


0.70 


0.33 


0.30 


7 


15.06 


8.08 


14.58 


6.13 


1.03 


50.41 


9.09 


56.94 


7.14 


0.89 


0.30 


0.26 


8 


129.2 


8.58 


93.09 


6.38 


1.39 


476.7 


9.46 


401.5 


7.05 


1.19 


0.27 


0.23 


9 


1127 


8.72 


603.6 


6.48 


1.87 


4483 


9.40 


2800 


6.97 


1.60 


0.25 


0.22 


10 


10980 


9.74 


3961 


6.56 


2.77 


50260 


11.2 


25140 


8.98 


2.00 


0.22 


0.16 



Table 8: Color-configuration averaged computation times r n in seconds obtained from the 4- and 5- dimensional color-ordered 
and color-dressed evaluations of n-gluon virtual corrections using 2.66 GHz Intel Core2 Quad processors. Results are shown 
for random phase- and color-space points and alternating gluon polarizations Aj. = Kfe, see text. The respective growth factors 
r„ = T n /r n _i are given where t„ denotes the time that is needed to calculate the n-gluon one-loop amplitude two times. The 
re-computation is used to check the two solutions for their consistency. Several time ratios are formed to compare the ordered 
with the dressed method and the 4- with the 5-dimensional computation. These ratios are displayed in the columns indicated 
accordingly. 



configuration: 


hard colors (ab)k 


simple colors (cd)k 


random non-zero colors 


fit values: 


a/10" 6 sec 


b 


a/10" 6 sec 


b 


a/10" 6 sec 


b 


4D, ordr 
5D, ordr 


1.91 
2.66 


9-75 j£» 

10.99 ±23 


34.5 
45.6 


c cc +0.32 
O-OO -0.30 

fi qq +0-29 
u.oy _ .28 


4.67 
7.84 


o r 7 +0.10 

-0.09 

9.46 ±° :\i 


4D, drss 
5D, drss 


39.4 
50.8 


fi 1Q +009 
D - iy -0.08 

7 oi +0.10 
' • z± -0.10 


28.2 
62.5 


-0.28 

6 92 + 08 

u - az -0.09 


38.7 
53.3 


6.30 +g;°* 

7 9 o +0.11 
' - zo -0.09 



Table 9: Parameter values a and b obtained from curve fitting of the computation times r n to the functional form of r n = ab n . 
The results are given for the three different n-gluon color assignments used in Tables \E\ (hard), [7] (simple) and [8] (random) and 
for all four algorithms the 4- and 5-dimensional color-ordered and color-dressed algorithm. 



two polarization settings. When inspecting the "ordr/drss" ratios, we observe that the ordered approach 
is advantageous in cases where only a few orderings contribute to the result of a certain point in color 
space. The fluctuations seen in the growth factors mirror the unsteady increase with n in non-zero orderings 
depicted in the last column of Table [31 For the dressed approach, we get similar, though somewhat smaller, 
growth factors compared to the previous test. In order to validate the dressed algorithm up to n = 12 
external gluons, we introduced a few more optimizations specific to the 4-dimensional calculations The 
lower part of Table [7] shows the computer times, which we obtained after optimization. They are consistent 
with our previous findings. As mentioned before, r„>io > 6 likely occur for reasons of increasingly complex 
higher-cut subtractions and computer limitations in dealing with large memory structures. 

For the calculation of the virtual corrections, one might question whether there exist enough points in 
color space that occur with many trivial orderings. If so, the color-decomposition based method would be 
more efficient on average. This is not the case for larger n as shown in Table[5J For gluon multiplicities of n — 
4, . . . , 10 and polarizations set according to k^, we list mean computation times, growth factors, "ordr/drss" 
and "4D/5D" ratios obtained for one- loop amplitude evaluations where the phase- and color-space points 
have been chosen randomly. Following the method outlined in Section 13.31 we only considered non-zero 
color configurations. We averaged over many events, for n = 4, . . . , 10 gluons, we used O(10 6 ), . . . , O(10 2 ) 
points. We observe that the pattern of the results in Table [SJ resembles that found in Tables [5] and [5] where 
we have studied the more complicated color point (ik)k — (ab)k- The ratios comparing the ordered and 



Some parts of the algorithm can be speed up when pentagon cuts are completely avoided. 
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Figure 7: Computation times r n versus the number n of external gluons for the three different gluon color assignments used 
in Tables \E\ (hard), [7] (simple) and [8] (random) . The results reported in these tables are shown for the 4- and 5-dimensional 
color-ordered and -dressed algorithms. The solid and dashed curves each represent the outcomes of the fits listed in Table [9] 
for both the dressed and ordered approach, respectively. 



dressed approach are smaller with respect to those of Tables [5] and [(5] This signals that the mean number 
of contributing orderings is somewhat lower than for the (ab)k case. We finally report dressed growth 
factors that are consistent with our previous findings confirming the approximate 6™ and 7 n growths in 
computational complexity of the new method for the 4- and 5-dimcnsional case, respectively. 

Using the results of Tables O [7J and [5] we have performed fits to the functional form r n = a b n . We 
show the outcome of the curve fittings in Table [5] Recall that the computation times have been obtained 
by using different color assignments for the n gluons. Tables [5] and [7] present results where we have chosen 
(ij)k — (a,b)k and (ij)k = (cd)k as examples of hard and simple color configurations, respectively. We 
have averaged over non-zero color settings to find the results of Table |8j Considering the performance of 
the dressed algorithm, we conclude that these data are in agreement with exponential growth for all color 
assignments. The errors on the fit parameter b are relatively small, only the 4-dimensional case of simple 
colors is somewhat worse because we included results up to n = 12 where parts of the computation become 
less efficient as explained above. The hard- and simple-colors case of the ordered approach show rather 
large errors for the &-parameter signalling that the genuine scaling law is not of an exponential kind in 
both cases. Interestingly, one observes an effective exponential scaling when averaging over many non-zero 
color configurations. The growth described by the 6-parameter is however a good two units stronger for the 
ordered approach than the growth seen in the color-dressed approach. To summarize, we have plotted in 
Fig. all computer times reported in Tables [5l [JJ and [5] as a function of the number of external gluons in 
the range 4 < n < 12. We have included in these plots the curves t„ = a b n , which we calculated from the 
respective fit parameters stated in Table [9j 
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Figure 8: Relative accuracies of the 1/e 2,1,0 poles of n = 6 gluon one-loop amplitudes as determined by the double-precision 

color-dressed algorithm. The gluon polarizations are given by = H 1 1 — , colors were chosen randomly among non-zero 

configurations. Vetoed events are included, only those with unstable ortho-vectors are left out, see text for more explanations. 
The mean accuracies and the number of randomly picked phase-space points are displayed in the top row and bottom left 
corner of the plot, respectively. 



In the following we will discuss the quality of the semi-numerical evaluations of A4n^ amplitudes for both 
the color-ordered and color-dressed approaches. To this end we analyze the logarithmic relative deviations 
of the double pole, single pole and finite part. Independent of the number n of gluons, we define them as 
follows: 

luWW M W I 2l7W (1)[1] -M {1)[2] I 

, K^dp.imra ~~ ^dp.thl , l yVa s/fp,nuro • /Vl s/fp,numl 

£dp = logi ° — i^n — ' £s/fp = logi ° imww \+\m w[2] i ' (66) 

l yV/l dp,thl K W s/fp,numl + l yV1 s/fp,num I 

where the structure of the double-poles M^ th is known analytically given by Eq. (foTj) . We use two 
independent solutions denoted by [1] and [2] to test the accuracy of the single poles and finite parts. All 
results reported here were obtained by using double-precision computations. We have run all our algorithms 
by choosing color configurations and phase-space points at random. Colors are distributed according to the 
"Non-Zero" method presented in Sec. 13.31 The phase-space points are accepted only if they obey the cuts, 
which we have specified at the beginning of this subsection. The gluon polarizations are always alternating 
set by Afc = Kk- Figure [H] shows the e distributions in absolute normalization, which we obtain from the 
5-dimensional color-dressed calculation for the case of n = 6 external gluons. The number of points used to 
generate the plots is given in the bottom left corner, the top rows display the means of the double-, single- 
pole and finite-part distributions. Limited to double-precision computations, we find that the numerical 
accuracy of our results for is satisfying. With e peak positions smaller than the respective mean values 

( £ d/s/fp) < ~8, we are able to provide sufficiently accurate solutions for almost all phase-space configurations. 
There is however a certain fraction of events where the single pole and finite part cannot be determined 
reliably. These 0(100) events occur because in exceptional cases small denominators, such as vanishing 
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n 


4D, ordr 


5D, ordr 


4D, drss 


5D, drss 


4 


1.0 


1.0 


1.0 


1.0 


5 


0.992 


0.991 


0.984 


0.984 (0.999) 


6 


0.960 


0.960 


0.964 


0.972 (0.994) 


7 


0.872 


0.873 


0.891 


0.892 (0.982) 


8 


0.635 


0.642 


0.829 


0.825 (0.953) 


9 


0.182 (0.84) 


0.205 (0.81) 


0.532 (0.93) 


0.533 (0.903) 


10 


0.0 (0.61) 


0.0 (0.50) 


0.38 (0.86) 


0.33 (0.83) 



Table 10: Fractions of n-gluon events that have a stable set of basis vectors in orthogonal space and also pass the veto on 
inaccurate master-integral bubble coefficients when using A ve to = 0.02. In brackets, fractions of n-gluon events that pass the 
test for unstable ortho-vectors. 



Gram determinants made of external momenta, cannot be completely avoided by the generalized-unitarity 
algorithms. We also see accumulation effects where larger numbers get multiplied together while determining 
the subtraction of higher-cut contributions. Owing to the limited range of double-precision calculations, such 
effects can lead to insufficient cancellations of intermediate large numbers that are supposed to cancel out 
even tuallyO The current implementation of the algorithm has no special treatment for these exceptional 
events. One either has to come up with a more sophisticated method treating these points separately or 
increase the precision with which the corrections are calculated. Both of which is beyond the scope of this 
paper and we leave it at vetoing these points. Yet, we need robust criteria that allow us to keep track of the 
quality of our solutions: we first test the orthonormal basis vectors that span the space complementary to 
the physical space constructed from the external momenta associated with the particular cut configuration 
under consideration. Failures in generating these basis vectors always lead to the rejection of the event0 In 
the example of Fig. [51 such events occurred with a rate of 0.6% and were not included in the plot. Secondly, 
and more importantly, we test the reliability of solving the systems of equations to determine the master- 
integral coefficients. To this end we generate an extra 4-dimensional loop momentum during the evaluation 
of the bubble coefficients establishing the cut-constructible part. Inaccuracies in solving for triangle etc. 
coefficients will be also detected, since at this level all higher-cut subtractions are necessary to obtain the 
correct value of the bubble coefficients. We use the extra loop momentum to individually re-solve for the 
cut-constructible bubble coefficient and compare this solution with the one obtained in first place. We veto 
the event, if the deviation A veto in the complex plane of the two solutions exceeds a certain amount. We fix 
the veto cut at A ve to = 0.02 for this publication. Having this cross-check at hand, we gain nice control over 
the events populating the tail of the accuracy distributions in Fig. [5J Applying the veto, we arrive at the 
distributions presented in the top left plot of Fig. HT1 where the steeper tails clearly demonstrate the effect of 
the veto. Certainly, both these shortcomings of imprecise ortho-vectors and inaccurately solved coefficients 
can be lifted by switching to higher precision whenever the respective double-precision evaluations have not 
passed our criteria. Accordingly, Table [TU] quantifies the fractions of events, which are within the scope of the 
color-dressed and color-ordered algorithms presented here. Owing to the more complicated event structures, 
the fraction of rejected events increases with n, where most of the events fail the bubble-coefficient test. We 
observe that the loss of events is more severe for the ordered algorithm. 

In the upper part of Figs. [51 1151 we show the distributions of relative accuracies e as occurring in the 
evaluation of gluon loop corrections with n — 4, . . . , 9 external gluons. The lower part of these figures and 
Figs. [TBI and [T71 themselves depict scatter graphs visualizing the relative accuracies as a function of the size 
of the virtual corrections for the single pole and finite contributions only, as the double pole contribution 
has no observable variance. This form of presenting the results has information on whether certain points 
dominate the uncertainty of the total correction when averaging over the phase space. The r- variables used 



11 More detailed explanations can be found in Ref. [30(1 . 

12 We test in particular whether the normalization of the orthonormal basis vectors deviates less than 10 -12 units from one. 
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Figure 9: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) extracted 

from double-precision computations of one-loop amplitudes for n = N = 4 gluons with polarizations = H 1 — and randomly 

chosen non-zero color configurations. The virtual corrections were calculated at random phase-space points satisfying the cuts 
detailed in the text. Unstable solutions were vetoed. Results from the color-dressed algorithm are compared with those of the 
color-ordered method indicated by dashed curves and brighter dots in the plots. The 5(4)-dimensional case is shown in the top 
left (right) and center (bottom) part of the figure. The definitions of e and r are given in the text. All scatter graphs contain 
2 X 10 4 points. 
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Figure 10: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) extracted 

from double-precision computations of one-loop amplitudes for n = N = 5 gluons with polarizations = H 1 h and randomly 

chosen non-zero color configurations. The virtual corrections were calculated at random phase-space points satisfying the cuts 
detailed in the text. Unstable solutions were vetoed. Results from the color-dressed algorithm are compared with those of the 
color-ordered method indicated by dashed curves and brighter dots in the plots. The 5(4)-dimensional case is shown in the top 
left (right) and center (bottom) part of the figure. The definitions of e and r are given in the text. All scatter graphs contain 
2 X 10 4 points. 
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Figure 11: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) extracted 

from double-precision computations of one-loop amplitudes for n = N = 6 gluons with polarizations Aj. = H 1 1 — and 

randomly chosen non-zero color configurations. The virtual corrections were calculated at random phase-space points satisfying 
the cuts detailed in the text. Unstable solutions were vetoed. Results from the color-dressed algorithm are compared with 
those of the color-ordered method indicated by dashed curves and brighter dots in the plots. The 5(4)-dimensional case is 
shown in the top left (right) and center (bottom) part of the figure. The definitions of e and r are given in the text. All scatter 
graphs contain 2 X 10 4 points. 
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Figure 12: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) as obtained 
from double-precision evaluations of one-loop amplitudes for n = N = 6 gluons with polarizations and colors set to Aj. = 

+ H and (ij)k = (12)(21)(13)(31)(11)(22), respectively. The virtual corrections were calculated at random phase-space 

points satisfying the cuts detailed in the text. The veto procedure has been applied to reject unstable solutions. The results 
given by the color-dressed algorithm are compared with those of the color-ordered method indicated by dashed curves and 
brighter dots in the plots. The 5(4)-dimensional case is shown in the top left (right) and center (bottom) part of the figure. 
The definitions of e and r are given in the text. Each scatter graph contains 2 X 10 4 points. 94.7(94.1)% and 91.2(91.0)% of 
the events pass all tests in the dressed and ordered "5(4)D-case" , respectively. 
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Figure 13: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) extracted 

from double-precision computations of one-loop amplitudes for n = N = 7 gluons with polarizations A^ = H 1 1 h and 

randomly chosen non-zero color configurations. The virtual corrections were calculated at random phase-space points satisfying 
the cuts detailed in the text. Unstable solutions were vetoed. Results from the color-dressed algorithm are compared with 
those of the color-ordered method indicated by dashed curves and brighter dots in the plots. The 5(4)-dimensional case is 
shown in the top left (right) and center (bottom) part of the figure. The definitions of e and r are given in the text. All scatter 
graphs contain 2 X 10 4 points. 
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Figure 14: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) extracted 

from double-precision computations of one-loop amplitudes for n = N = 8 gluons with polarizations Aj. = H 1 1 1 — 

and randomly chosen non-zero color configurations. The virtual corrections were calculated at random phase-space points 
satisfying the cuts detailed in the text. Unstable solutions were vetoed. Results from the color-dressed algorithm are compared 
with those of the color-ordered method indicated by dashed curves and brighter dots in the plots. The 5(4)-dimcnsional case 
is shown in the top left (right) and center (bottom) part of the figure. The definitions of e and r are given in the text; the 
number of points contained by each scatter graph is found in the lower left. 
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Figure 15: Double-, single-pole and finite-part accuracy distributions (upper part) and scatter graphs (lower part) extracted 

from double-precision computations of one-loop amplitudes for n = N = 9 gluons with polarizations A^=H 1 1 1 h 

and randomly chosen non-zero color configurations. The virtual corrections were calculated at random phase-space points 
satisfying the cuts detailed in the text. Unstable solutions were vetoed. Results from the color-dressed algorithm are compared 
with those of the color-ordered method indicated by dashed curves and brighter dots in the plots. The 5(4)-dimcnsional case 
is shown in the top left (right) and center (bottom) part of the figure. The definitions of e and r are given in the text; the 
number of points contained by each scatter graph is found in the lower left. 
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Figure 16: Single-pole and finite-part scatter graphs extracted from the double-precision computation of one-loop amplitudes 

for n = N = 10 gluons with polarizations = H 1 1 1 1 — and randomly chosen non-zero color configurations. 

The virtual corrections were calculated at random phase-space points satisfying the cuts as described in the text. Unstable 
solutions were vetoed and, therefore, not included in the plots. The upper (lower) row of plots shows the results obtained from 
the 5(4)-dimensional color-dressed algorithm. For the definition of r, see text. The number of points contained by each scatter 
graph can be found in the lower left. 
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and represent corrections of the order of a s . Specifically, the r, r' and rth given in the plots are obtained 
by employing M {1) = M ( ^ ] {p ^ num , M (1) = M [ ^ ] num and M (1) = M^ th , respectively. In all cases we 
have rejected events with unreliable basis vectors in orthogonal space. Except for the results presented in 
Fig. |T7l we have vetoed all events that led to unstable solutions of the bubble master-integral coefficient 
using A vcto = 0.02. The statistics concerning these rejections is shown in Table ITOl 



We compare in all plots of Figs. [9lfT5l the color-dressed with the color-ordered approach where the results 
of the latter are indicated by dashed curves in the spectra (with the (e) given by the lower top row of 
numbers) and brighter points in the scatter graphs. The e spectra of the "5D-case" ("4D-case") are always 
shown in the top left (right) parts of the figures; the associated scatter graphs are compiled in the center 
(bottom) parts. In Fig. \W\ we present our results for n — 10 gluons where for reasons of limited statistics we 
solely show the scatter graphs related to the dressed method. The veto procedure has a very strong impact 
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Figure 17: Double-, single-pole and finite-part scatter graphs visualizing the accuracy of double-precision evaluations of one- 
loop amplitudes for n = N = 9 and 10 gluons of alternating polarizations. Non-zero randomly chosen color configurations were 
used. Note that unstable solutions were not vetoed and therefore included in this presentation. The virtual corrections were 
calculated at random phase-space points satisfying the cuts as described in the text. Results of the 5-dimensional algorithms 
either based on color ordering (ordr) or color dressing (drss) are shown; for n = N = 9, the "4D-case" results are also given 
(ccp only). For the definition of r, see text; axis labels as used in Fig. I16l are understood. The rightmost graph contains O(20) 
points per e-pole, while the left plot of the "5(4)D-case" has approximately 50(120) points per pole. 



on A4g w calculations. For the purpose of direct comparisons between vetoed and non-vetoed samples, we 
have added in Fig. [T7] scatter plots that include vetoed events. 

In all cases we notice that the double poles are obtained very accurately with almost no loss in precision 
for increasing number of gluons. The n-dependence of the single-pole and finite-part precisions is not as 
stable as for the double pole. We see noticeable shifts of the peak and mean positions towards larger 
values when incrementing the number of external gluons. The distribution's tails are under good control. 
Because of the introduced veto procedure, they quickly die off around e « —2. In rare cases worse accuracies 
occur, which happens more frequently for the 5-dimensional calculations. We can avoid these cases, if we 
extend the veto criteria by re-solving for and testing the rational bubble coefficient as well. For n > 9, the 
limitations of double-precision computations unavoidably lead to rather unreliable single-pole and finite-part 
determinations. As an interesting fact, we observe that the color-dressed method yields throughout results 
of higher precision. Moreover, the decrease in accuracy for growing n is more moderate compared to the 
method based on color ordering. Clearly, on the one hand this algorithm has to be run for many orderings 
and may therefore lead to an accumulation of small imprccisions. On the other hand a rather inaccurate 
determination of may appear just for a single ordering, in turn spoiling the overall result. Both effects 
make the ordered approach less capable of delivering accurate results. Turning to the scatter plots, we 
find that the most accurate but also inaccurate evaluations occur for points distributed near the vertical 
line of 0(1) corrections. It is very encouraging that all top right quadrants are rather sparsely populated, 
dispelling the doubts that insufficiently determined large corrections may dominate our final results. The 
scatter regions of the double-pole solutions remain almost unchanged for larger n, while those of the single 
poles and finite parts are slightly growing gradually shifting towards lower relative accuracies. The scatter 
patches of the dressed method are displaced with respect to those of the color-decomposition approach: 
advantageously, they cover regions of greater precision, in particular populate the bottom right quadrants 
more densely. Due to the simplicity of the 4-gluon kinematics, the case of n = 4 gluons stands out from 
the rest: the single pole and finite part can be obtained with almost the same accuracy as the double pole. 
This feature is preserved even if rational-part calculations are included. With 5 gluons or more it is common 
that all coefficients contribute to the decomposition of the one-loop amplitude. The relative accuracies of 
the single poles and finite parts therefore develop a much different, less steeper, tail compared to the double 
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Figure 18: Finite-part versus single-pole accuracy (in double precision) as achieved in one-loop amplitude calculations using 

the color-dressed approach for various numbers n = N of external gluons with polarizations = H . . . H (+) and colors 

randomly chosen among non-zero configurations. Note that unstable solutions have not been vetoed. The n = N = 9 and 
n = N = 10 graphs only contain 1.6 ■ 10 3 and 87 points, respectively, whereas all other plots comprise 10 4 points. 

poles. There are almost no differences between the double- and single-pole results obtained from the 4- and 
5-dimensional algorithms. This is no surprise, since the coefficients necessary to reconstruct these poles can 
be determined in 4 dimensions and our algorithms have been set up accordingly. In the absence of rational- 
part calculations it turns out that the finite parts may on average be obtained slightly more precisely than 
the single poles. The tails of the 1/e spectra reach out to the largest e-values occurring in the evaluation of 
the cut-constructible part. The behavior is reversed in the 5-dimensional case owing to the addition of the 
rational part. For the same reason, we note increased (et p ) in the "5D-case", furthermore, the 5-dimensional 
scatter graphs show higher densities with respect to the 4-dimensional ones at lower accuracies. 
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As a special case of Fig. [TT] we have displayed in Fig. [T^] accuracy distributions and scatter plots for 
n = 6 gluons of polarizations = + H when instead of random color-space points the fixed non- 
zero color configuration (ij)k = (12) (21) (13) (31) (11) (22) has been selected. We notice that all e-spectra 
are shifted towards smaller accuracies. Also, as illustrated by the scatter graphs, the magnitude of the 
virtual corrections is bound at 0(1) with the exception of the finite piece of the cut-constructible part of 
the one- loop amplitudes. Interestingly, this is corrected back by adding in the rational part. 

In Ref. [30| | it was shown that the finite-part accuracy of the evaluation of ordered amplitudes is mostly 
correlated with that of the single poles. We have studied this issue for the dressed algorithm in the "5D-case" . 
The corresponding scatter plots also include the vetoed events and are presented in Fig. [18] The multitude 
of points is distributed along the diagonal indicating a strong correlation. As for color-ordered amplitudes 
the evaluation of the rational part becomes more involved with increasing gluon numbers. Therefore, regions 
of lower finite-part precision start to get populated distorting the diagonal trend. 

Finally, we want to show that the Monte Carlo sampling as defined in Eq. (j4"4")l converges sufficiently fast 
for the color-dressed calculated virtual corrections. To this end we generalize the LO discussion following 
Eq. (j35|) with the details given in Sec. 13.31 The relevant quantity to explore in the Monte Carlo averaging is 
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where we choose as = 0.12 and M.^ \ is the finite part of the virtual corrections. The sum over the -ZV co i p t s 
color configurations for each phase-space point is an optional "mini-Monte Carlo" over colors for faster 
convergence as a function of the number of phase-space point evaluations. By adding the real corrections 
to Eq. (|68p and performing the coupling constant renormalization and mass factorization, one obtains the 
gluonic contribution to the NLO multi-jet differential cross section. Therefore, the convergence of Eq. (|68[) 
is the relevant quantity to study. 



By defining the ra-gluon color-summed counterpart of Si 
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we can form the ratios 



i) - 



(S 



MC 



Wcol 



/c.(0+l) 
WMC 



± <7,„(o+i), 
Wmc ) 



(0) } 



(70) 



analogously to Eq. (f3~7| . We define the mean values and standard deviations of the ratios similarly to 
Eqs. (|3"5j) and (f3"9"]> . respectively. Note that is already defined at LO by Eq. (|5d| . As we increase 
the number of Monte Carlo points, Nmc, the R^ -ratios quantify the relative importance of the virtual 
corrections, while the i?( 0+1 )-ratios should converge to one. For the latter, this is nicely demonstrated in 
Fig. HH for the 4-gluon virtual corrections and the "Non-zero" sampling scheme as described in Sec. 13.31 
After 15900 events we obtain = 0.939 ± 0.039, which is satisfactory for this consistency check. 

As in the LO discussion we want to illustrate how many events are needed to achieve a certain relative 
integration uncertainty when performing the Monte Carlo color sampling. In analogy to Eq. (|40j) we can 
construct the ratio 
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Figure 19: Consistency test for the Monte Carlo integration of 4-gluon virtual corrections using "Non-Zero" color sampling 
compared to the exact color summing. As a function of the number JVmc of evaluated phase-space points the -ratio is 

plotted converging to one as it should be. The inserted plot shows the number of phase-space evaluations needed to reach a 
given relative accuracy in terms of fiJ^J 1 ' (JVmcO while Monte Carlo integrating; for the definitions, see text. The dashed line 
depicts the fit function cr/fi = A , see also Table 1111 



in order to study the impact of the virtual corrections. As before we partition AT event = Atrial x JVmc events 
to have a certain number of trials to compute the corresponding mean values fi and standard deviations a for 
n-gluon LO and virtual scattering according to Eqs. (|4ip and (|42|) . respectively. For the case of /^^'(JVmc) 
and 4-gluon scattering, the number of Monte Carlo points versus a given relative accuracy is shown in the 
inlaid plot of Fig. []j|J As at LO, the curve bends behaving as statistically determined after a certain amount 
of Monte Carlo integration steps. 

To quantify the color- integration performances, we again perform fits to the functional form A x 
and show the values of the fitted parameters in Table [TT] for the various cases. As argued in Sec. I3.3l for large 
enough Nmc, we expect a scaling of a//j, that is proportional to 1/v^Vmc- The goodness of the sampling 
schemes is signified by the A- and A'-parameters, where the latter is more important since the time factors 
are included. Smaller values of these parameters indicate a better efficiency of the sampling procedure. 

Using the i?( v ) and R^q(Nuc) ratios, we summarize in Figs. our Monte Carlo integration results 

for n — 4, . . . , 7 gluon processes and for the various color-sampling schemes. The upper graphs display 
the averaging of S^ 1 ' normalized to the Monte Carlo average of the color-summed LO contribution as 
a function of the number of phase-space evaluations^ We also indicate the estimate of the integration 
uncertainty, see Eqs. ([70)1 and (|3T)|) . To compare all different test cases, Table [T21 list the final values for 
R<y\ In all these fi gures we plot in the lower graphs the number of phase-space point evaluations needed 



13 As for the LO studies in Sec. 13.31 the gluon polarizations are taken alternating and remain fixed while performing the 
Monte Carlo integrations. 
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0.497 


22.0 
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5.41 


17.0 


10.4 


0.476 


3.57 


13.5 


16.4 


0.485 


2.05 


15.6 


65.7 


5 


0.482 


59.4 


0.454 


13.3 


43.3 


13.5 


0.442 


9.71 


36.4 


19.8 


0.439 


5.56 


37.8 


79.2 


6 


0.325 


7.08 


0.344 


5.37 


14.0 


16.3 


0.255 


1.60 


3.50 


21.7 


0.233 


0.850 


2.14 


87.6 



Table 11: Parameter values B, A and A' obtained from curve fitting of the ct(Rmc) I M(-^Mc) 1° the functional form A X iV Mt -, . 
The results are given for the different ways of sampling over colors in n-gluon scattering. The 4-gluon case marked by "*" 
corresponds to the consistency check shown in Fig. 1191 where has been considered. In all other cases R^c has been 

used, cf. Figs. l20l 121 1 and 1221 Note that for n = 6, we have fitted <j(rQq). The parameters A' = Af B take into account that 
the evaluation of a fixed number of Monte Carlo events takes longer for the other than "Naive" color-sampling methods. The 
time factors / relative to the "Naive" case are also displayed. 
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4 TO 6 


0.4739 ± 0.0054 


4 -10 6 


0.4750 ±0.0017 


4 TO 6 


0.4724 ± 0.0013 


1 TO 6 


0.4738 ± 0.0020 
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631K 


0.241 ± 0.022 


631K 


0.2673 ± 0.0072 


631K 


0.2744 ± 0.0058 


160K 


0.2790 ± 0.0058 


6 


64K 


-0.10 ±0.12 


64K 


-0.059 ± 0.094 


50.2K 


-0.076 ± 0.062 


16K 


-0.044 ± 0.066 


7 


4K 


-0.87 ±0.66 


4K 


-0.23 ±0.09 


4K 


-0.14 ±0.10 


2K 


-0.97 ±0.65 



Table 12: Monte Carlo integration results for the ij( v ) ratios as defined in the text after %q phase-space point evaluations 
for n-gluon scattering and different color-sampling schemes using color-dressed tree-level and one-loop amplitude calculations. 



to reach a certain relative integration uncertainty on R^^Nmc)- We show in Table [TT] the results of the 
curve fittings represented by the dashed lines in these plots. 

As is clear from these Monte Carlo averaging tests and results, the convergence is more than satisfactory 
for future applications of the color-dressing techniques in NLO calculations. If faster sampling convergence 
is required we can evaluate multiple color configurations per phase-space point. This is shown in the graph, 
where we have chosen to evaluate four color configurations at one phase-space point. 

5. Conclusions 

In this paper we explored the possibility of color sampling within the context of D-dimensional generalized 
unitarity. Up to now generalized unitarity has only been used within the context of color-ordered primitive 
amplitudes. In the color-ordered approach, color is treated differently from the other quantum numbers 
such as spin and flavor. This makes the reconstruction of the full one-loop amplitude rather cumbersome. 

We have reformulated the D-dimensional generalized unitarity formalism to include color dressing. That 
is, we choose the explicit color of each parton, together with all other quantum numbers, for each Monte 
Carlo event. In this way all particles, colored or colorless, are treated on an equal footing. There is no 
distinction between different particles as far as the formalism goes. Consequently, the resulting algorithm is 
independent of the type and flavor of the external particles. E.g. the same algorithm calculates the 6-gluon 
virtual corrections, the 6-photon virtual corrections and the ^±6 parton virtual corrections. 

The use of unordered amplitudes requires the partition of the external legs into unordered subsets. This 
is necessary for the calculation of the tree-level amplitudes as well as for generating all the unitarity cuts. 
As a result the complexity of the resulting algorithm is exponential. That is, the computer time needed 
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Figure 20: Upper graph: convergence of the 4-gluon virtual corrections integration as a function of the number of evaluated 
phase-space points. Also shown is the standard deviation as an estimator of the integration uncertainty. Lower graph: 
convergence of the Monte Carlo integration, where the relative integration uncertainty is shown as a function of the number of 
phase-space evaluations. The dashed lines describe the fit functions a j [i = A , see also Table lTTl The "Naive" , "Conserved" 
and "Non-Zero" color-sampling methods are explained in Sec. 13.31 The points indicated by "Non-Zero, A r co i pta= 4" average over 
4 color configurations per phase-space point. 
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Figure 21: Upper graph: convergence of the 5-gluon virtual corrections integration as a function of the number of evaluated 
phase-space points. Also shown is the standard deviation as an estimator of the integration uncertainty. Lower graph: 
convergence of the Monte Carlo integration, where the relative integration uncertainty is shown as a function of the number of 
phase-space evaluations. The dashed lines describe the fit functions a j [i = A , see also Table lTTl The "Naive" , "Conserved" 
and "Non-Zero" color-sampling methods are explained in Sec. 13.31 The points indicated by "Non-Zero, A r co i pta= 4" average over 
4 color configurations per phase-space point. 
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Figure 22: Upper graph: convergence of the 6-gluon virtual corrections integration as a function of the number of evaluated 
phase-space points. Also shown is the standard deviation as an estimator of the integration uncertainty. Lower graph: 
convergence of the Monte Carlo integration, where this time the standard deviation is shown as a function of the number of 
phase-space evaluations. Note that for this case, the virtual corrections are as large as the LO contribution so that the full 
result is close to zero. The dashed lines describe the fit functions a = AN^^, see also Table [TT] The "Naive", "Conserved" 
and "Non-Zero" color-sampling methods are explained in Sec. 13.31 The points indicated by "Non-Zero, A r co i pts= 4" average over 
4 color configurations per phase-space point. 
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Figure 23: Upper graph: convergence of the 7-gluon virtual corrections integration as a function of the number of evaluated 
phase-space points. Also shown is the standard deviation as an estimator of the integration uncertainty. Lower graph: 
convergence of the Monte Carlo integration, where for this case, only the standard deviation is shown as a function of the 
number of phase-space evaluations. The "Naive", "Conserved" and "Non-Zero" color-sampling methods are explained in 
Sec. 13.31 The points indicated by "Non-Zero, N co i pts —4" average over 4 color configurations per phase-space point. 
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to calculate the virtual corrections grows with a constant multiplicative factor when one adds external 
particles. In addition, we have to sum over all color states of the internal lines. One may conclude from 
these general features that the implementation of the color-dressed D-dimensional generalized unitarity is 
less efficient in comparison with an implementation based on ordered primitive amplitudes. As we have 
explicitly demonstrated for the example of calculating the virtual corrections to n-gluon scattering, this is 
not the case. We compared the color-sampling approach for both the color-ordered and color-dressed case. 
The calculation of the virtual corrections in the color-dressed case scales as 7™, while in the color-ordered 
case the effective scaling up to 10 gluons behaves as 9". Moreover, the color-dressed calculation has a better 
accuracy in calculating the value of the one-loop amplitude. The improved accuracy over color-ordered 
evaluations increases with n. 

As we showed for n-gluon scattering, the color-dressed approach becomes more efficient than the color- 
ordered method for large n. One could argue that the differences are small and color sampling over the 
ordered n-gluon amplitudes will work as well. However, when including quarks and other electro-weak 
particles the color-dressed approach will easily win out over the color-ordered approach. This is because any 
notion of primitive amplitudes is absent. The algorithm simply calculates the virtual correction. Moreover, 
the color-dressed algorithm remains identical when including quarks and electro-weak particles. It is this 
algorithmic simplicity that will enable us to employ parallel programming to significantly improve the 
computer evaluation time. 

We conclude that the color-dressed formulation is competitive for calculating one-loop virtual corrections 
for n-gluon scattering. It is expected that it will be even more efficient in calculating virtual corrections for 
processes involving quarks and electro- weak gauge bosons in addition to the gluons. 
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A. The Tree-Level 6-Quark Amplitude 



As an example we can take a few recursive steps in calculating the 6-quark tree-level matrix element. 
We start with the definition of the tree-level matrix element in terms of the 5-quark fcrmionic current 



M (0) (u,u,d,d,s,s) = P- 1 [j(u,u,d,d,s) , J (a)] 



(73) 



where we use the shorthand notation u = u^ 1 {K{), u = uj* 1 (K 2 ), d = d^ (K 3 ), d = dj* 2 (K 4 ), s = s£ (K 5 
and s = sJ 3 X:s (K 6 ). The 5-quark fermionic current decomposes into 



J- s (u, u, d, d, s) = P g [^[j(d,d,s) , J(u,u)] 

+ P s [d[J(s), J(u,u,d,d)] 
The 3-quark fcrmionic current decomposes into 



D[J(u,u,s),J(d,d)] 



Js(q,q,s) = Ps 



£>[j(s),J(q,q); 



(74) 



(75) 



where q G {u,d} and q € {u, d}. The 1-quark fermionic current is simply the source term. Finally the 
4-quark gluonic current is given by 



Jg (u, u, d, d) 



Pe 



+ P, 



D[J(u),J(d,d,u)] 
D[J(d),J(u,u,d)]' 



D[J(u,d,u),J(u)] 
D[J(d,u,u),J(d)] 



(76) 



4G 



and the 2-quark gluonic current is written as 



J g (q,q) = P s £>[j(q),J(q)] 



(77) 



The above steps define the 6-quark LO amplitude recursively as would be done by the algorithm. Note that 
we have ignored all flavor violating currents. 



B. The Implemented Gluon Recursion Relation 

Making use of the color-flow representation [32[, we define the color-dressed gluon currents as 3 x 3 
matrices of ordered gluon currents: 

J^Hqi 1 ) = ^/jJM x ) > (78) 

where the external gluon g\ has the polarization Ai and four- momentum K\, its colors are denoted by (ij)i- 
The color-flow labels of the dressed current are (IJ) and /i indicates the Lorentz label. Using this definition, 
the connection to the compact notation introduced in Sec. 13. H is found as 

J g (gi) - S^S^e^iKi) = Sjtfj^) = 4 J %^) ■ (79) 

Since we only consider gluons, a plain numbering of the external particles = {gk, Xk, (ij)ki Kk} is sufficient 
and helps simplify the notation such that the color dressing becomes more emphasized. Hence, in all what 
follows we write J g (l) = 5^ S] 1 J p ( 1 ) = jj/ (1). Dressed n- gluon currents are then described by 

Jj{ j \l, 2, . . . , n) = J2 5 LjZl ■ ■ ■ S iV 5 J n • • • , o-n) , (80) 

o-GS„ 

which follows as a consequence of the color decomposition of the tree-level amplitude into ordered ones: 

M^(l,2,...,n,n + 1) = £ ^ 5 tl ' ' ' 5 tV 5 tli . . . , a n , n + 1) . (81) 

o-GS„ 

The vectors a describe the elements of the permutations S n of the set {1,2, ... ,n}. With the color-ordered 
amplitudes m^°\ai, 02, . . . , <7 n , n + 1) expressed through ordered J-currents and the definition of the dressed 
currents at hand, we can re-write the last equation and formulate the tree-level amplitude in terms of the 
color-dressed currents: 

M^(l,2,...,n,n+l) = A:,., „ ; x 

<?es n 

= K\ia,..M f ) (M 1 ...,n)J m "(n + l) . (82) 

Owing to the simple color structure of the one-gluon current, the summation over the color indices {I J) 
effectively reduces to the calculation of a single scalar product of the ordered currents jj^ n + 1 ^ n + 1 > anc j 
jO'n+i The invariant-mass prefactor K 2 is determined by the gluon momenta via K% 2 n j = 

(K\ +K2 + ■ ■ ■ + K n ) 2 - The one-gluon current is given in Eq. 1(75]). while the multi-gluon current is obtained 
recursively. Starting from Eq. (|80p . one incorporates the ordered gluon recurrence relation to evaluate 
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J/i(<Ti, . . . , a n ) and re-groups accordingly to identify the partitioning. After some algebra, one finds 



J"(l,2,...,n) = K 



{1,2,. .,«.} 



srILN X INL 



E( r/J 
{"KM J ~ U MKJ 

P„ 17I2 (l,...,n) 



EfxILNP , r 
\°KMOJ + °i 



cILPN r 
OMKJ ~ °KOMJ 



INPL 
MOKJ 



{4 KL) M,4 MN H^),4 OP) (^)} 



7Tl <-> 7T 2 



(83) 



where we have employed the bracket notation for ordered-current operations, which was introduced in 
Ref. [20]. The partition sums are explained in Sec. 13. f I and an implicit summation over the color indices 
K, L, M, N, O, P is understood. To efficiently compute the dressed currents, the color factors in front of the 
operator brackets can be pre-calculated such that the computation of zero color-weight contributions can 
be avoided. We have used the shorthand notation 



°jl-n — °n 



(84) 

The recursion relation presented in Eq. ((83]) scales asymptotically as 4™, since we kept the 4-gluon vertex as 
an entity in our calculation. As a consequence we have to evaluate 3-subset partitions and the corresponding 
curly brackets that merge three different dressed currents. 
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